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Novel Compounds 

FIELD OF THE INVENTION 

This invention relates to polyniKjleotides, (herein referred to as '*BASB232 
polynucleotide(s)"), polypeptides encoded by them (referred to herein as "BASB232" or 
"BASB232 polypeptide(s)"), recombinant materials and methods for their production. In 
particxilar, the invention relates to immunogenic compositions and vaccines containing 
single polypeptide or nucleotides or advantageous combinations thereof. In another aspect of 
the invention, the invention relates to methods for using such polypeptides and 
polynucleotides for the treatment or prevention of Bordetella infections. In a further aspect, 
the invention relates to diagnostic assays for detecting Bordetella infection. 

BACKGROUND OF THE INVENTION 

The bacterium Bordetella pertussis is the causative agent for whooping coug^ a 
respiratory disease that can be severe in infants and young children. The clinical course 
of the disease is characterised by paroxysms of rapid coughs followed by inspiratory 
effort, often associated with a characteristic *whooping' soimd. In serious cases, oxygen 
deprivation can lead to brain damage, however the most common complication is 
secondary pneumonia. 

Whooping cough is usually considered to be caused by 5. pertussis^ but occasionally B, 
parapertussis is isolated Jfrom patients with typical signs and syn^toms of whooping 
cough. B. parapertussis infection is of lower frequency than B. pertussis witih 5-10% of 
whooping cough being associated with B. parapertussis (Mertsola (1985) Eur J Clin 
Microbiol 4; 123; Lautrop (1971) Lancet 1(771 1) 1 195-1 198), B. parapertussis is 
associated with mild clinical symptoms which, combined with its serological cross- 
reactivity with B.pertussis, makes B. parapertussis difBcult to diagnose. 

The first generation of vaccines against B, pertussis were whole cell vaccines, composed 
of whole killed bacteria. These were introduced in many countries in the 1950s and 
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1960s and were successful at reducing the incidence of whooping cough. A problem 
with \;^^ole cell B. pertussis vaccines is the high level of reactogenicity associated with 
them. Acellular vaccines containing purified B. pertussis proteins are less reactogenic 
and have been adopted for the vaccination programmes of many countries. Acellular 
vaccines typically containing pertussis toxin (PT), filamentous haemagglutinin (FHA) 
and quite often pertactin (PRN)> widely used and provide effective protection firom 
the severity of whooping cough. 

Despite vaccination, whooping cough remains an endemic disease (Mooi et al (2001) 
Emerging Infectious Diseases 7; 526). Whooping cough has re-emerged in Australia, 
Canada and The Netherlands; coimtries with highly vaccinated populations. A 
comparison of pre-vaccination strains with strains isolated recently, has shown antigenic 
drift, particularly in PT and PRN (Mooi et al (1998) Infection and Immunity 66; 670). It 
is widely acknowledged that current vaccines protect against severe disease but do not 
eliminate Bordetella pertussis from die body (Cherry et al (1998) Vaccine 16; 1901, 
Hewlett and Halperin (1998) Vaccine 16; 1899, Storsaeter et al (1998) Vaccine 16; 
1907). The defence mechanisms of Bordetella pertussis allow it to evade elimination 
firom the body, indicating that current vaccines do not conopletely disable these defence 
mechanisms. 

Vaccination using whole cell B. pertussis vaccines (Pw), appears to protect against B, 
parapertussis infection, probably due to the similarity of the two bacteria. B. 
parapertussis infection in unvaccinated infants may lead to severe and fatal 
complications, whereas in indiAdduals vaccinated with Pw, a milder, often subclinical 
course of whooping cough is seen (Long et al (1990) Pediatric Infect Dis J 9; 700). 
Theoretically, the introduction of acellular pertussis vaccines containing only two or 
three-purified protdns could reduce the ability of vaccination to protect against B. 
parapertussis. 
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Accordingly, further iicproved acellular vaccines against whooping cough are required 
that combine low reactogenicity with an ability to elicit a protective response against 
Bordetella, particularly both B, pertussis and B. parapertussis, infection. The 
identification of new candidate antigens and particularly effective combinations of 
antigens will allow the development of such vaccines. 

SUMMARY OF THE INVENTION 

The present invention relates to immxmogenic compositions containing BASB232, in 
particular BASB232 polypeptides or BASB232 polynucleotides, recombinant materials and 
methods for their production. In a further aspect, the invention relates to combination of 
polypeptides or nucleotides that interact advantageously in the prevention or treatment of 
microbial, particularly Bordetella, disease. In another aspect, the invention relates to 
methods for using such polypeptides, polynucleotides and combinations, including 
prevention and treatment of Bordetella diseases, amongst others. In a further aspect, the 
invention relates to diagnostic assa;^ for detecting diseases associated with microbial 
infections and conditions associated with such infections, such as assays for detecting 
expression or activity of BASB232 polynucleotides or polypeptides. 

Various changes and modifications within the spirit and scope of the disclosed invention 
will become readily apparent to those skilled in the art fi^om reading the following 
descriptions and from reading the other parts of the present disclosure. 

DESCRIFnON OF FIGURES 

Figure 1 — is a graph showing protection against challenge with B, pertussis strain 
Tohama in groups of mice pre-immimised with carrier DT BrkA, DTPa-2, DTPa-2 
BrkA, DTPa-3 or DTPa-3 BrkA. Results are expressed as the number of CPU isolated 
per lung at different time points after challenge. Pa-2 is a combination of pertussis toxin 
and FHA, whilst Pa-3 is a combination of p^tussis toxin, FHA and pertactin. 
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Figure 2 — is a graph showing protection against challenge wifh B, pertussis strain 
18323 in groiq)S of mice pre-immunised with carrier DT BrkA, DTPa-2, DTPa-2 BrkA, 
DTPa-3 or DTPa-3 BrkA. Results are expressed as the nmnber of CFU isolated per lung 
at different time points after challenge. 

Figure 3 — graphs showiog protection against challenge with B. pertussis or B, 
parapertussis in groups of mice preimmunised with DTPw or DTPa from several 
sources. Results are expressed as number of CFU isolated from the lung at different 
time points after challenge. 

Figure 4 — graphs showing protection against B. parapertussis by antibodies against 
whole cell B. pertussis. 

Panel A shows active protection against B, parapertussis following inmiunization of 
mice with DTPw vaccine. 

Panel B shows passive protection against ^B. parapertussis by antisera raised against 
DTPw. 

Figure 5 - Alignment of the SDS-PAGE ofB. parapertussis OMP and the 
corresponding Westem blot treated with sera raised against B. pertussis (DTPw). The 
running buffer used for these gels was MES. Alignment was performed using prestained 
molecular weight standards as reference points. 

Figure 6 - Aligmnent of the SDS-PAGE of A parapertussis OMP and the 
correi^onding Westem blot treated with sera raised against B. pertussis (DTPw). The 
running buffer used for these gels was MOPS. Alignment was performed using 
prestained molecular weight standards as reference points. 

DESCIOFTION OF THE INVENTION 

The ruvention relates to BASB232 polypeptides and polynucleotides as described in 
greater detail below. In particular, the invention relates to polypeptides and 
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polynucleotides of BASB232 of B. pertussis, particularly comprised ia immuaogenic 
conipositions. 

The inveation relates especially to BASB232 polynucleotides and encoded polypeptides 
listed in table 1. Hiose polynucleotides and encoded polypeptides have the nucleotide and 
amino acid sequences set out in SEQ ID NO: 1 to SEQ ID NO: 1 10 as described in table 1 , 
Table 1 



Name 


Length 
(nT) 


Length 
(aa) 


SEQ 
ID 

nucL 


SEQ 
ID 
prot. 


nescnption 


Orfl7 


3033 


1010 


33 


34 


BikA, Bordetella pertussis (81%) 


Orfl 


2211 


737 


1 


2 


Ferric enterobactin receptor (BfeA), Bordetella 
pertussis (95%) 


Orf2 


2475 


812 


3 


4 


Probable hydroxamate-type ferrisiderophore receptor 
(BfirB), Pseudomonas aeruginosa (40%) 


Or£3 


2403 


729 


5 


6 


Putative hydroxamate-type ferrisiderophore receptor 
signal peptide protein (BfrC), Pseudomonas 
solanacearum (38%) 


Orf4 


2304 


734 


7 


8 


Putative ferric siderophore receptor (FauA), Bordetella 
bronchiseptica (97%) 


OrfS 


2187 


825 


9 


10 


Unidentified ferric siderophore receptor, Bordetella 
bronchiseptica (94%) 


Oif6 


2064 


801 


11 


12 


Ferric alcaligin siderophore recq)tor, Bordetella 
pertussis (100%) 


Orf7 


2229 


743 


13 


14 


Hydroxamate-type ferrisidero-phore receptor (iron 
transport protein £[u), Pseudomonas aeruginosa (37%) 


Oif8 


2268 


756 


15 


16 


Hydroxamate-type ferrisidero-phore receptor (iron 
transport protein fiu), Pseudomonas aeruginosa (41%) 


Or© 


2106 


702 


17 


18 


Putative hydroxamate-type ferrisiderophore receptor 
signal peptide protein^ Pseudomonas solanacearum 
(40%) 


OrflO 


2610 


870 


19 


20 


BhuR, outer membrane heme receptor, Bordetella 
pertussis (100%) 


Orfll 


2280 


760 


21 


22 


Probable tonb-dependent receptor, Pseudomonas 
aeruginosa (34%) 


Orfl2 


1887 


629 


23 


24 


Probable tonb-dependent receptor, Pseudomonas 
aeruginosa (34%) 


Orfl3 


1731 


577 


25 


26 


Ferrisiderophore receptor-like protein, Pseudomonas 
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sp (57%) 


Orfl4 


1434 


478 


27 


28 


Probable toiiB-dq>endaiit lec^tor Yncd precursor, 
Eschenchia con (56%) 


Orfl5 


2730 


910 


29 


30 


Pertactin outer membrane protein, Bordetella pertussis 
(100%) 


Orfl6 


2748 


915 


31 


32 


VagS protein, Bordetella pertussis (96%) 


Orfl? 


3033 


1010 


33 


34 


BikA, Bordetella pertussis (81%) 


Orfl8 


1944 


647 


35 


36 


Tcf protem, Bordetella pertussis (74%) 


Orfl9 


1245 


418 


37 


38 


Phg protem, Bordetella pertussis (81%) 


OrfZO 


2712 


903 


39 


40 


BapA protein, Bordetella pertussis (85%) 


OrGI 


1446 


482 


41 


42 


BapB protein, Bordetella pertussis (87%) 


Orf22 


2277 


759 


43 


44 


Putative autotransporter BapC, Bordetella pertussis 
(86%) 


OtQ3 


1545 


515 


45 


46 


Pertactin-like protein, Bordetella pertussis (47%) 


Orf24 


1191 


397 


47 


48 


Tcf-like protein, Bordetella pertussis (56%) 


OrG5 


6903 


2300 


49 


50 


Bxtracellular serine protease. Brucella melitensis (25%) 


OtQ6 


2622 


873 


51 


52 


Autotiansporta protein, Agrobacterium tumefaciens 
(43%) 


Orf27 


3120 


1039 


53 


54 


Autotransporter subtilisin-like protease (SphBl), 
Bordetella pertussis (93%) 


OrfZS 


2241 


747 


55 


56 


Heme/hemopexin utilization protein c precursor, 
Haemophilus influenzae (48%) 


Or£29 


1575 


525 


57 


58 


Lipoprotein (piln protein), Escherichia coli (22%) 


OrGO 


1509 


503 


59 


60 


Inununogenic protein, Deinococcus radiodttrans (35%) 


Or£31 


1491 


497 


61 


62 


Probable outer membrane lipoprotein precursor, 
Pseudomonas aeruginosa (48%) 


OrS2 


1491 


497 


63 


64 


Probable outer membrane efflux protein precursor, 
Pseudomonas aeruginosa (43%) 


Orf33 


1380 


460 


65 


66 


Oprm, Pseudomonas aeruginosa (45%) 


Orfi4 


1347 


449 


67 


68 


Probable outer membrane channel signal peptide 
protein, Ralstonia solanacearum (40%) 


Orf35 


1287 


429 


69 


70 


Putative membrane-bound lytic murein 
transglycosylase a transmembrane protein (MltA), 
Ralstonia solanacearum (42%) 


OtQ6 


1143 


381 


71 


72 


Putative membrane-bound lytic murein 
transglycosylase b protein (MltB), Ralstonia 
solanacearum (40%) 


Orf37 


1095 


365 


73 


74 


Putative polysaccharide export protein yccz precursor, 
Escherichia coli (34%) 
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OrGS 


897 


299 


75 


76 


Putative serine protease transmembrane protein^ 
Ralstonia solanacearum (55%) 


OrG9 


852 


284 


77 


78 


Hypothetical protein pa4632, Pseudomonas aeruginosa 
(52%) 


Orf40 


846 


282 


79 


80 


Competence lipoprotein coml precursor. Neisseria 
meningitidis (45%) 


Orf41 


813 


271 


81 


82 


Probable lipoprotein precursor (vacj) transmembrane, 
Ralstonia solanacearum (43%) 


Orf42 


801 


267 


83 


84 


Putative outer membrane lipoprotein. Salmonella 
typhimurium (24%) 


Orf43 


690 


230 


85 


86 


Flagellar l-ring protein precursor (basal body l-ring 
protein), Escherichia coli (51%) 


Orf44 


678 


226 


87 


88 


Hypothetical lipoprotein ydcl precursor, Escherichia 
coli (32%) 


Orf45 


558 


186 


89 


90 


Probable peptidoglycan-associated lipoprotein 
precursor (Pal), Ralstonia solanacearum (63%) 


Orf46 


552 


184 


91 


92 


Putative outer membrane lipoprotein (OmlA), 

Bordetella pertussis (100%) 


Orf47 


546 


182 


93 


94 


Hypothetical transmembrane protein smc00354, 
JUiizobium meliloti (36%) 


Orf48 


501 


167 


95 


96 


Putative out^ membrane lipoprotein transmembrane, 
Ralstonia solanacearum (40%) 


Orf49 


456 


152 


97 


98 


Lipoprotein, Vibrio cholerae (44%) 


OrfSO 


5307 


1769 


99 


100 


Autotransporter Bordetella parapertussis (100%) 
BPP0452 


Orf 51 


579 


193 


101 


102 


OmpA Boraetelia pertussis (lUUyb) 


Orf52 


579 


193 


103 


104 


OmpA Bordetella parapertussis (100%) BPP3135 


Orf53 


2229 


743 


105 


106 


Probable TonB-dependent receptor for iron transport 
Bordetella parapertussis (100%) BPP3376 


Or£54 


1155 


385 


107 


108 


Outer membrane porin protein precursor Bordetella 
pertussis (100%) 


Orf55 


1164 


388 


109 


110 


Outer membrane porin protein precursor Bordetella 
parapertussis (100%) BPP3392 



The percentage shown in table 1 are the identity percentage shared by each sequence of the 
BASB232 polypeptides and their homologous polypeptides found in B, pertussis or in other 
organisms (by a BLAST homology search). 
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It is understood that sequences recited in the Sequence Listing below as "DNA" represent 
an exemplification of one embodiment of the invention, since those of ordinary skill will 
recognize that such sequraices can be usefully en:q)loyed in polynucleotides in general, 
including ribopolynucleotides. 

The sequences of the BASB232 polynucleotides are set out in SEQ ID NO: 33, 1, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 
105, 107, 109. SEQ Group 1 refers herein to the group of polynucleotides set out in SEQ 
ID NO: 33, 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 39, 41, 43, 45,. 
47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 
95, 97, 99, 101, 103, 105, 107, 109. 

The sequences of the BASB232 encoded polypeptides are set out in SEQ ID NO: 34, 2, 4, 
6, 8, 10, 12, 14, 16. 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 
56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 
102, 104, 106, 108, 110. SEQ Group 2 refers herein to the groiq> of encoded polypeptides 
set out in SEQ ID NO: 34, 2, 4, 6, 8, 10, 12, 14. 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 
40, 42, 44, 46, 48, 50, 52, 54, 56, 58. 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 
88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110. 

The BASB232 polynucleotides set out in SEQ ID 1, 3, 5. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 

27 and 105 belong to the iron transporter protein family. 

The BASB232 polynucleotides set out in SEQ ID 33, 29, 31, 35, 37, 39, 41, 43, 45, 47, 49, 
51, 53 and 99 belong to tiie autotransporter proteins family. 

The BASB232 polynucleotides set out in SEQ ID 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 
75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95 and 97 belong to the Upoproteins family. 

Tne BASB232 polypeptides set out in SEQ ID 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 

28 and 106-belong to the mm transporter protein &mily. 

The BASB232 polypeptides set out in SEQ ID 34, 30, 32, 36, 38, 40, 42, 44, 46, 48, 50, 52, 
54 and 100 belong to the autotransporter proteins family. 
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The BASB232 polypeptides set out in SEQ ID 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92, 94, 96 and 98 belong to the lipoproteins family. 

Polypeptides 

In one aspect of the invention th^e are proAdded polypeptides of B. pertussis referred to 
herein as "BASB232" and "BASB232 polypeptides" as well as biologically, diagnostically, 
prophylactically, clinically or therapeutically useful variants thereo:^ and compositions, 
preferably immunogenic compositions comprising the same. 

The present invention further provides for: 

(a) an isolated polypeptide which conq>rises an amino acid sequence which has at least 
85% identity, preferably at least 90% identity, more preferably at least 95% identity, most 
preferably at least 97, 98 or 99% or exact identity, to that of any sequence of SEQ Groiq) 
2; 

(b) a polypeptide encoded by an isolated polynucleotide comprising a polynucleotide 
sequence which has at least 85% identity, preferably at least 90% identity, more preferably 
at least 95% identity, even more preferably at least 97, 98 or 99% or exact identity to any 
sequence of SEQ Group 1 over the entire length of the selected sequence of SEQ Group 1; 
or 

(c) a polypeptide encoded by an isolated polynucleotide comprising a polynucleotide 
sequence encoding a polypeptide which has at least 85% identity, preferably at least 90% 
identity, more preferably at least 95% identity, even more preferably at least 97-99% or 
exact identity, to the amino acid sequence of any sequence of SEQ Qrovop 2. 

The BASB232 polypeptides provided in SEQ Group 2 are the BASB232 polypeptides 
from B. pertussis (or B. parapertussis) as described in table 1 . It is envisaged that B, 
parapertussis (or B. pertussis) sequences may be used. 

The invention also provides an immunogenic fragment of a BASB232 polypeptides, that 
is, a contiguous portion of the BASB232 polypeptide winch has the same or substantially 
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the same iinmuiiogemc activity as fhe polypeptide comprising the corre^onding amino 
acid sequence selected from SEQ Group 2 ; That is to say, the fragment (if necessary 
when coupled to a carrier) is capable of raising an immune response which recognises the 
BASB232 polypeptide. Such an immunogeuic fragment may include, for exanq)le, the 
BASB232 polypeptide lacking an N-tenrdnal leader sequence, and/or a transmembrane 
domatEL and/or a C-temiinal anchor domain, la a preferred aspect ttie immunogenic 
fragment of BASB232 accordiug to the invention comprises substantially aU of the 
extracellular domain of a polypeptide which has at least 85% identity, preferably at least 
90% identity, more preferably at least 95% identity, most preferably at least 97-99% 
identity, to that a sequence selected from SEQ Group 2 over the entire length of said 
sequence. 

A fragment is a polypeptide having an amino add sequence that is entirely the same as part 
but not all of any amino acid sequence of any polypeptide of the invention. As with 
BASB232 polypeptides, fragments may be "free-standing," or comprised within a larger 
polypeptide of which they form a part or region, most preferably as a sin^e continuous 
region in a single larger polypeptide. 

Preferred fragments include, for example, truncation polypeptides having a portion of an 
amino acid sequence selected from SEQ Group 2 or of variants th^eof, such as a continuous 
series of residues that includes an amino- and^or carboxjd-terminal amino acid sequence. 
Degradation forms of the polypeptides of the invention produced by or in a host cell, are 
also preferred. Further preferred are fragments characterized by structural or frinctional 
attributes such as fragments that conq)rise alpha-helix and alpha-helix forming regions, 
betarsheet and beta-sheet-forming regions, turn and turn-forming regions, coil and coil- 
forming regions, hydrophilic regions, hydrophobic regions, alpha an^Upathic regions, beta 
amphipatlri c regions, flexible regions, sur&ce-forming regions, substrate binding region, and 
high antigenic index regions. 
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Further preferred firagmeats include an isolated polypeptide conq>rismg an amino acid 
sequence having at least 15, 20, 30, 40, 50 or 100 contiguous amino acids from the amino 
acid sequence selected from SEQ Groiq) 2 or an isolated polypeptide comprising an amino 
acid sequence having at least 15, 20, 30, 40, 50 or 100 contiguous amino acids truncated 
or deleted from the amino acid sequence selected from SEQ Group 2 . 

The BASB232 polypqjtides set out in SEQ ID 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 
53 and 54 belong to an autotransporter proteins family. In this family, there are two 
domains : the passenger domain that is surface exposed and the beta domain that is anchored 
in the outer membrane protein. The passenger domain is a prefeired fragment for vaccine 
use. The passenger domain was predicted for each of the BASB232 polypeptides set out in 
SEQ ID 30, 32, 34, 36, 38, 40, 42, 44, 50, 52 and 100 in table 2. 



Table 2 



encoded peptidic 
sequence 


1" amino adds of the 
preferred fk^gment 


Last amino adds of the 
preferred fragment 


SEQIDNO:30 


35 


604 


SEQIDNO:32 


40 


614 


SEQIDNO:34 


41 


706 


SEQIDNO:36 


40 


132 


SEQIDNO:38 


36 


114 


SBQIDNO:40 


31 


595 


SEQIDNO:42 


1 


185 


SEQIDNO:44 


1 


458 


SEQIDNO:50 


38 


1984 


SEQ ID NO:52 


43 


561 


SEQ ID NO: 100 


39 


1453 
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Fragments described in table 2 are preferred fragments. These fragments may be readily 
modified by adding or removing 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40 or 50 amino acids 
from either or both of the N and C termini. 

StiU further preferred fragments are those which conqirise a B-oell or T-helper epitope, for 
exanq>le those fragments/peptides described in Example 8. 

Fragments of the polypeptides of the invention may be eirq^loyed for producing the 
corresponding full-length polypeptide by peptide synthesis; therefore, these fragments may 
be employed as intermediates for producing the full-length polypeptides of the inventioxL 

The term ''fi-agmenf encompasses the fragment itself or the fragment may be part of a 
larger protein or a fusion protein. 

Partictilarly preferred are variants in which several, 5-10, 1-5, 1-3, 1-2 or 1 amino acids 
are substituted, deleted, or added in any conibination. 

The polypeptides, or immunogenic fragments, of the invention may be in the form of the 
**mature" protein or may be a part of a larger protein such as a precursor or a fusion 
protein. It is often advantageous to include an additional amino acid sequence which 
contains secretory or leader sequences, pro-sequences, sequences which aid in 
purification such as multiple histidine residues, or an additional sequence for stability 
dxiring recombinant production. Furthermore, addition of exogenous polypeptide or 
Upid tail or polynucleotide sequences to increase the immunogenic potential of the final 
molecule is also considered. 

In one aspect the invention relates to genetically engineered soluble fusion proteins 
conq)rising a polypeptide of the present invention, or a fragment thereof, and various 
portions of the constant regions of heavy or light chains of immunoglobulins of various 
subclasses (IgG, IgM, IgA, IgJE). Preferred as an immunoglobiilin is the constant part of 
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the heavy chain of human IgG, particularly IgGl, where fiision takes place at the hinge 
region, hi a particular embodiment, the Fc part can be removed sin5)ly by incorporation 
of a cleavage sequence which can be cleaved with blood clotting factor Xa. 

Furthermore, this invention relates to processes for the preparation of these fusion 
proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and 
therapy. A fiirther aspect of the invCTtion also relates to polynucleotides encoding such 
fiision proteins. Examples of fusion protein technology can be found in hitemational 
Patent Application Nos. W094/29458 and W094/22914. 

The proteins may be chemically conjugated, or expressed as recombinant fusion 
proteins allowing increased levels to be produced in an expression system as compared 
to non-fused protein. The fusion partner may assist in providing T helper epitopes 
(inmmnolo^cal fusion partner), preferably T helper epitopes recognised by humans, or 
assist in expressing the protein (expression enhancer) at higher yields than the native 
recombinant protein. Preferably the fusion partner will be both an immunological 
fusion partner and expression enhancing partner. 

Fusion partners include protein D from Haemophilus influenzae and the non-stmctural 
protein from influenza virus, NSl (hemagglutiuxin). Another fusion partner is the protein 
known as Omp26 (WO 97/01638). Another fusion partner is the protein known as 
LytA. Preferably the C terminal portion ofthe molecule is used. LytA is derived from 
Streptococcus pneumoniae which synthesize an N-acetyl-L-alanine amidase, amidase 
LytA, (coded by the lytA gene {Gene, 43 (1986) page 265-272}) an autolysin that 
specifically degrades certain bonds in the peptidoglycan backbone. The C-terminal 
domain ofthe LytA protein is responsible for the afiSnity to the choline or to some 
choline analogues such as DBAE. This property has been exploited for the development 
of E.coli C-LytA expressing plasmids useful for expression of fusion proteins. 
Purification of hybrid proteins containing the C-LytA fragment at its amino terminus 
has been described {Biotechnology: 10, (1992) page 795-798}. It is possible to use the 
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repeat portion of the LytA. molecule found in the C teiminal end starting at residue 178, 
for example residues 188 - 305. 

Hie present invention also includes variants of the aforementioned polypeptides, that is 
polypeptides that vary from the refeKsats by conservative amino acid substitutions, 
whereby a residue is substituted by anoth«ar with like characteristics. Topical such 
substitutions are among Ala, Val, Leu and He; among Ser and Thr; among the acidic 
residues Asp and Glu; among Asn and Ghi; and among the basic residues Lys and Aig; or 
aromatic residues Phe and Tyr. 

Polypeptides of the present invention can be prepared in any suitable maimer. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well 
understood in the art. 

It is most preferred that a polypeptide of the invention is derived fiom B. pertussis, however, 
it is optionally obtained fixmi other organisms of flie same taxonomic genus. A polypeptide 
of the invention may also be obtained, for exaniple, fiom organisms of the same taxonomic 
femily or order (for instance Bordetetta parapertussis or Bordetella bronchoseptica). 

Polynucleotides 

It is an object of the invaition to provide polynucleotides that encode BASB232 
polypeptides, particularly polynucleotides that encode polypeptides herein designated 
-BASB232. 

hr a~p3rticula7ly preferred aiibodimenL of die invention tihie polynucleotide conqirises a 
r^on encoding BASB232 polypeptides comprising sequences set out in SBQ Gioupl 
A^ch include fidl lei^ gene, or a variant or fiagment thereof. 
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Polynucleotides of the invention do not mcon^pass a conq)lete genomic DNA from a 
Bordetella species, e.g. B. pertussis or A parapertussis. 

As a further aspect of the invention there are provided isolated nucleic acid molecules 
encoding and/or expressing BASB232 polypeptides and polynucleotides, particularly B. 
pertussis orB. parapertussis BASB232 polypq>tides and polynucleotides, including, for 
example, unprocessed RNAs, ribozyme RNAs, mKNAs, cDNAs, B- and Z-DNAs. 
Further embodiments of the invention include biologically, diagnostically, 
prophylactically, clinically or therapeutically useful polynucleotides and polypeptides, 
and variants thereof, and con:5)ositions, preferably immunogenic compositions, 
con5)rising the same. 

Another aspect of the invention relates to isolated polynucleotides, including at least one full 
length gene, that encode BASB232 polypeptides having a deduced amino acid sequence of 
SEQ Group 2 and polynucleotide closely related thereto and variants thereof 

In another particularly preferred embodiment of the invention relates to BASB232 
polypeptides from B, pertussis or B. parapertussis comprising or consisting of an amino 
acid sequence selected from SEQ Group 2 or a variant thereof 

Using the information provided herein, such as a polynucleotide sequence set out in SEQ 
Group 1, a polynucleotide of the invention encoding BASB232 polypeptide may be obtained 
using standard cloning and screening methods, such as those for cloning and sequencing 
chromosomal DNA fragments from bacteria using B. pertussis strain Tohama I cells as 
starting material, followed by obtaining a fuU length clone. For exan5)le, to obtain a 
polynucleotide sequence of the invention, such as a polynucleotide sequence given in 
SEQ Groiq) 1, typically a library of clones of chromosomal DNA of A pertussis strain 
Tohama I in E.coli or some other suitable host is probed with a radiolabeled 
oligonucleotide, preferably a 17-mer or longer, derived fit)m a partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using stringent 
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hybridization conditions. By sequencing the individual clones thus identified by 
hybridization with sequaidng primers designed from the original polypeptide or 
polynucleotide sequence it is then possible to extend the polynucleotide sequence in both 
directions to determine a full length gene sequence. Conveniently, such sequencing is 
performed, for example, using denatured double stranded DNA prepared firom a plasmid 
clone. Suitable techniques are described by Maniatis, T., Fritsch, E.F. and Sambrook et 
aL. MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989). (see in particular Screening By 
Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Tenqjlates 13.70). 
Direct genomic DNA sequencing may also be performed to obtain a full length gene 
sequence. Illusti:ative of the invention, each polynucleotide set out in SBQ Gioijp 1 was 
discovered in a DNA Ubrary derived fix)m B. pertussis or B. parapertussis. 

Moreover, each DNA sequence set out in SEQ Groip 1 contains an opan reading fiame 
encoding a protein having about the number of amino acid residues set forth in SEQ Gioiq) 
2 with a deduced molecular weight tiiat can be calculated usmg amino acid residue 
molecular weight values well known to those skilled in the art. 

The polynucleotides of SEQ Group 1, between Ihe start codon and tiie stop codon, encode 
respectively the polypq)tides of SEQ Groiq> 2. The nucleotide number of start codon and 
first nucleotide of the stop codon are Usted in table 3 for each polynucleotide of SEQ Qrovq) 
1. 

Table 3 

The respective SEQ ID NO for each Qrf is found in Table 1 . 



Name 


Start codon 


1*^ nucleotide of stop codon 


Orfl 


1 


2212 


Orf2 


1 


2476 
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Orf3 


1 


2404 


Orf4 


1 


2305 


Or£5 


1 


2188 


Orf6 


1 


2065 


OrfZ 


1 


2230 


OrfS 


1 


2269 


Or© 


1 


2107 


OrflO 


1 


2611 


Orfll 


1 


2281 


Orfl2 


1 


1888 


Orfl3 


1 


1732 


Orfl4 


1 


1435 


Orfl5 


1 


2731 


Orfl6 


1 


2746 


Orfl7 


1 


3031 


Orfl8 


1 


1942 


Orfl9 


1 


1255 


Orf20 


1 


2710 


OrfZl 


1 


1447 


Or£Z2 


1 


2278 


OrfiS 


1 


1546 


Orf24 


1 


1192 


Or£25 


1 


6901 


Orf26 


1 


2620 


OrfZ? 


1 


3118 


OrG8 


1 


2242 




Or£29 


1 


1576 




OrGO 


1 


1510 




Orf31 


1 


1492 




Orf32 


1 


1492 




Qr03 


1 


1381 




Orf34 




1348 




OrB5 




1288 




Orf36 




1144 




OrB7 




1096 




OrfiS 




898 




OrCP 




853 




Oif40 




847 




Orf41 
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Orf42 


1 


802 


Orf43 


1 


691 


Orf44 


1 


679 


Orf45 


1 


559 


Orf46 


1 


553 


Orf47 


1 


547 


Orf48 




502 


Orf49 




457 


OrfSO 




5308 


OrfSl 




580 


Orf52 




580 


Orf53 




2230 


Orf54 




1156 


Orf55 




1165 



Jn a furfher aspect, the present invention provides for an isolated polynucleotide 
conqjrising or consisting of: 

(a) a polynucleotide sequence which has at least 85% identity, preferably at least 90% 
identity, more preferably at least 95% identity, even more preferably at least 97, 98 or 
99% or exact identity to any sequence from SEQ Grotq) 1 over the entire length of the 
polunucleotide sequence from SEQ Gionp 1; or 

(b) a polynucleotide sequence aicoding a polypeptide which has at least 85% identity, 
preferably at least 90% identity, more preferably at least 95% identity, even more 
preferably at least 97. 98 or 99% or 100% exact, to any ammo add sequence selected 
from SEQ Gioiq) 2 , over the entire length of the amino acid sequence from SEQ Group 
2. 



A polynucleotide encoding a polypeptide of the present invention, including homologs and 
orlhologs from species other than B. pertussis, may be obtained by a process which 
comprises the steps of screening an appropriate library under stringent hybridization 
conditions (for example, using a temperature in the range of 45 - 65°C and an SDS 
concentration from 0.1 - 1%) wife a labeled or detectable probe consisting of or comprisinfi 
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any sequence selected from SEQ Group 1 or a fiagmeat thereof; and isolating a fuU-lengfh 
gene and/or ^omic clones containing said polynucleotide sequence. 

The invention provides a polynucleotide sequence identical over its entire length to a coding 
sequence (open reading frame) set out in SEQ Groi^ 1. Also provided by the invention is a 
coding sequence for a mature polypeptide or a fragment thereof, by itself as well as a coding 
sequence for a mature polypeptide or a fragment in reading frame with another coding 
sequence, such as a sequence encoding a leader or secretory sequence, a pre-, or pro- or 
prepro-protein sequence. The polynucleotide of the invention may also contain at least one 
non-coding sequence, including for exaarqple, but not limited to at least one non-coding 5' 
and 3' sequmce, such as the transcribed but non-translated sequences, termination signals 
(such as rho-dependent and rho-independent termination signals), ribosome binding sites, 
Kozak sequ^ces, sequences that stabilize mRNA, introns, and polyadenylation signals. 
The polynucleotide sequence may also conopiise additional coding sequence encoding 
additional amino acids. For exanople, a madsisr sequence tiiat &cilitates purification of the 
frised polypeptide can be encoded. In certain embodiments of the invention, the marker 
sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen, Inc.) and 
described in Gentz et al, Proc. Natl AccuL Sci., USA 86: 821-824 (1989), or an HA peptide 
tag (Wilson et al. Cell 37: 161 (1984), both of which may be useful in purifying 
polypeptide sequence fused to them. Polynucleotides of the invention also include, but are 
not limited to, polynucleotides conq)rising a structural gene and its naturally associated 
sequences that control gene expression. 

The nucleotide sequences encoding the BASB232 polypeptides of SEQ Group 2 maybe 
identical to the corresponding polynucleotide encoding sequences of SEQ Group 1 . The 
position of the first and last nucleotides of the encoding sequences of SEQ Goup 1 are 
listed in table 4. Alternatively it may be any sequence, which as a result of the redundancy 
(degeneracy) of the genetic code, also encode polypeptides of SEQ Groiq> 2. 
Table 4 
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nucleotidic 


eacoded peptidic 


Start codon 


Last nucleotide 

ui cucuumg 
sequence 


SEQIDNO:! 


SEQIDNO:2 


1 


2211 


SEQIDNO:3 


SEQIDNO:4 


1 


2475 


SEQIDNO:5 


SEQIDNO:6 


1 


2403 


SEQIDNO:? 


SEQIDNO:8 


1 


2304 


SEQIDNO:9 


SEQ ID NO: 10 


1 


2187 


SEQIDNO: 11 


SEQ ID NO: 12 


1 


2064 


SEQIDNO: 13 


SEQ ID NO: 14 


1 


2229 


SEQIDNO:15 


SEQ ID NO: 16 


1 


2268 


SEQIDNO: 17 


SEQIDNO:18 


1 


2106 


SEQIDNO: 19 


SEQIDNO:20 


1 


2610 


SEQIDN0:21 


SEQIDNO:22 


1 


2280 


SEQIDNO:23 


SEQIDNO:24 


1 


1887 


SEQIDNO:25 


SEQ ID NO:26 


1 


1731 


SEQIDNO:27 


SEQIDNO:28 


1 


1434 


SEQIDNO:29 


SEQIDNO:30 


1 


2730 


SEQIDNO:31 


SEQIDNO:32 


1 


2745 


SEQnDNO:33 


SEQIDNO:34 


1 


3030 


SEQIDNO:35 


SEQIDNO:36 


1 


1941 


SEQIDNO:37 


SEQIDNO:38 


1 


1254 


SEQIDNO:39 


SEQ ID NO:40 


1 


2709 


SEQIDNO:41 


SEQIDNO:42 




1446 


SEQIDNO:43 


SEQIDNO:44 




2277 


SEQ ID NO:45 


SEQIDNO:46 




1545 


SEQIDNO:47 


SEQIDNO:48 




1191 


SEQ ID NO:49 


SEQ ID NO:50 




6900 
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SEQIDNO:51 


SEQIDNO:52 


1 


2619 


SEQIDNO:53 


SEQIDNO:54 


1 


3117 


SEQ ID NO:55 


SEQ ID NO: 56 


1 


2241 


SEQIDNO:57 


SEQIDNO:58 


1 


1575 


SEQIDNO:59 


SEQ ID NO:60 


1 


1509 


SEQIDNO:61 


SEQIDNO:62 


1 


1491 


SEQIDNO:63 


SEQIDNO:64 


1 


1491 


SEQIDNO:65 


SEQIDNO:66 


1 


1380 


SBQIDNO:67 


SEQIDNO:68 


1 


1347 


SEQ ID NO:69 


SEQIDNO:70 


I 


1287 


SEQ ID NO:71 


SEQIDNO:72 


1 


1143 


SEQ ID NO:73 


SEQIDNO:74 


1 


1095 


SEQIDNO:75 


SEQIDNO:76 


1 


897 


SEQIDNO:77 


SEQIDNO:78 


1 


852 


SEQIDNO:79 


SEQIDNO:80 


1 


846 


SEQIDNO:81 


SEQIDNO:82 


1 


813 


SEQIDNO:83 


SEQIDNO:84 


1 


801 


SEQIDNO:85 


SEQIDNO:86 


1 


690 


SEQIDNO:87 


SEQ ID NO:88 


1 


678 


SEQIDNO:89 


SEQ ID NO:90 


1 


558 


SEQIDNO:91 


SEQIDNO:92 


1 


552 


SEQ ID NO:93 


SEQIDNO:94 


1 


546 


SEQIDNO:95 


SEQIDNO:96 




501 


SEQIDNO:97 


SEQIDNO:98 




456 


SEQ ID NO: 99 


SEQ ID NO: 100 




5307 


SEQ ID NO: 101 


SEQ ID NO: 102 




579 


SEQ ID NO: 103 


SEQ ID NO: 104 




579 


SEQ ID NO: 105 


SEQ ID NO: 106 




2229 
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SEQIDNO: 107 


SEQIDNO: 108 


1 


1155 


SEQIDNO; 109 


SEQIDNO: 110 


1 


1164 



The term "polynucleotide encodmg a polypeptide" as used herein enconq)asses 
polynucleotides that include a sequence encoding a polypq>tide of the invention, particularly 
a bacterial polypeptide and more particularly a polypeptide of the B. pertussis or B 
parapertussis BASB232 having an amino acid sequence set out in any of tiie sequences of 
SEQ Group 2. The term also encompasses polynucleotides that include a sin^e continuous 
region or discontinuous regions encoding the polypeptide (for exan^jle, polynucleotides 
inteirupted by integrated phage, an integrated insertion sequaice, an integrated vector 
sequence, an integrated transposcm sequence, ot due to RNA editing or genomic DNA 
reorganization) to^er with additional r^ons, that also may contain coding and/or non- 
coding sequences. 

The invention furliier relates to variants of the polynucleotides described herein tinat encode 
variants of a polypeptides having a deduced amino acid sequence of any of tiie sequences of 
SEQ Qtoup 2 . Fragments of polynucleotides of the invention may be used, for example, to 
synthesize full-length polynucleotides of the invention. 

Preferred fragments are those polynucleotides which encode a B-cell or T-helper epitope, for 
example the fragments/peptides described in Example 8, and recombioant, chimeric genes 
conqjrising said polynucleotide fragments. 

Further particularly preferred embodiments are polynucleotides encoding BASB232 
variants, that have the amino add sequence of BASB232 polypeptides of any sequence from 
SEQ Groiq) 2 in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues 
are substituted, modified, deleted and/or added, in any combmation. Especially preferred 
a mong these are silent substitutions, additions and deletions, fliat do not alter the properties 
and activities of BASB232 polypq}tides. 
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Furflier prefetted embodiments of flic invention are polynucleotides that are at least 85% 
identical over fhar entire length to polynucleotides encoding BASB232 polypeptides having 
an amino acid sequence set out in any of the sequences of SEQ Group 2 , and 
polynucleotides fliat are conq>lementary to such polynucleotides. Alternatively, most highly 
preferred are polynucleotides tiiat conqarise a r^on that is at least 90% identical over its 
entire length to polynucleotides encoding BASB232 polypeptides and polynucleotides 
con5)lementary thereto. In this regard, polynucleotides at least 95% identical over their 
entire length to the same are particularly preferred. Furfhermore, those with at least 97% are 
highly preferred among those with at least 95%, and among these those with at least 98% 
and at least 99% are particularly highly preferred, witii at least 99% being die more 
preferred. 

Preferred embodiments are polynucleotides encoding polypeptides that retain substantially 
the same biological function or activity as mature polypeptides emcoded by a DMA 
sequences selected fiom SEQ Groiq) 1 . 

In accordance with certain preferred embodiments of this invention there are provided 
polynucleotides that hybridize, particularly under stringent conditions, to BASB232 
pofynucleotide sequences, such as those polynucleotides in SEQ Group 1. 

The invention further relates to polynucleotides that hybridize to the polynucleotide 
sequences provided herein. In this regard, the invention especially relates to polynucleotides 
that hybridize under stringent conditions to the polynucleotides described herein. As herein 
used, the terms "stiingent conditions" and "stringent hybridization conditions" mean 
hybridization occurring only if tiiere is at least 95% and preferably at least 97% identity 
between flie sequences. A specific example of stringent hybridization conditions is 
overnight incubation at 42''C in a solution comprising: 50% formamide, 5x SSC (150mM 
NaCl, 15mM tiisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardf s 
solution, 10% dextian sulfate, and 20 micrograms/ml of denatined, sheared sahnon sperm 
DNA, followed by washing the hybridization siq)port in O.lx SSC at about 65°C. 
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Hybridization and wash conditions are well known and exen5)lified in Sambrook, et al^ 
Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N. Y., 
(1989), particularly Chapter 1 1 therein. Solution hybridization may also be used with the 
polynucleotide sequences provided by the invention. 

The mveution also provides a polynucleotide consisting of or comprising a polynucleotide 
sequence obtained by screening an appropriate hbrary containing the complete gene for a 
polynucleotide sequence set forth in any of the sequences of SEQ Group 1 under stringent 
hybridization conditions v^th a probe having the sequence of said polynucleotide 
sequence set forth in the correspondmg sequences of SEQ Qioxxp 1 or a fragment thereof; 
and isolating said polynucleotide sequence. Fragments useful for obtaining such a 
polynucleotide include, for example, probes and primers fully described elsewhere herein. 

As discussed elsewhere herein regarding polynucleotide assays of the invention, for 
instance, the polynucleotides of the invention, may be used as a hybridization piobe for 
RNA, cDNA and genomic DNA to isolate full-length cDNAs and genomic clones encoding 
BASB232 and to isolate cDNA and genomic clones of other genes that have a high identity, 
particularly high sequence identity, to the BASB232 genes. Such probes generally will 
comprise at least 15 nucleotide residues or base pairs. Preferably, such probes will have at 
least 30 nucleotide residues or base pairs and may have at least 50 nucleotide residues or 
base pairs. Particularly preferred probes will have at least 20 nucleotide residues or base 
pairs and will have less than 30 nucleotide residues or base pairs. 

A coding region of BASB232 genes may be isolated by screening using a DNA sequences 
provided m SEQ Group 1 to synthesize an oUgonucleotide probe. A labeled oligonucleotide 
having a sequence complementary to that of a gene of the invention is then used to screen a 
library of cDNA, genomic DNA or mRNA to detemiine which members of the library the 
probe hybridizes to. 
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There are several methods available and well known to those skilled in the art to obtain 
full-lengfh DNAs, or extend short DNAs, for exan5)le those based on the method of RBspid 
Amplification of cDNA ©ads (RACE) (see, for example, Frohman, et al, PNAS USA 85: 
8998-9002, 1988). Recent modifications of the technique, exenq>lified by the Marathon™ 
technology (Qontech Laboratories Inc.) for exaniple, have significantly sin5)lified the 
search for longer cDNAs. In the Marathon™ technology, cDNAs have been prepared 
from mRNA extracted from a chosen tissue and an 'adaptor' sequence ligated onto each 
end. Nucleic acid amplification (PCR) is then carried out to amplify the "rtrissing" 5' end 
of the DNA using a combination of gene specific and adaptor specific oligonucleotide 
primers. The PCR reaction is then repeated using "nested" primers, that is, primers 
designed to ameal within the amplified product (typically an adaptor specific primer that 
aimeals further 3' in the adaptor sequence and a gene specific primer that anneals finlher 5' 
in the selected gene sequence). The products of this reaction can then be analyzed by 
DNA sequencing and a fiall-length DNA constructed either by joining the product directly 
to the existing DNA to give a cotx^lete sequence, or carrying out a separate full-length 
PCR usmg the new sequence information for the design of the 5' primer. 

The polynucleotides and polypeptides of the invention may be employed, for example, as 
research reagents and materials for discovery of treatments of and diagnostics for diseases, 
particularly human diseases, as further discussed h^ein relating to polynucleotide assays. 

The polynucleotides of the invention that are oligonucleotides derived firom a sequence of 
SEQ Group 1 may be used in the processes herein as described, but preferably for PCR, to 
determine whether or not the polynucleotides idratified herein in whole or in part are 
transcribed in bacteria in infected tissue. It is recognized that such sequences will also 
have utility in diagnosis of the stage of infection and type of infection the pathogen has 
attained. 

Hie invmtion also provides polynucleotides that encode a polypeptide that is the mature 
protein plus additional amino or carboxyl-terminal amino acids, or amino acids interior to 
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the mature polypeptide (when the mature form has more than one polypeptide chain, for 
mstance). Such sequences may play a role in processing of a protein fiom precursor to a 
mature form, may allow protein transport, may lengthen or shorten protein half-life or may 
fecilitate manq)ulation of a protein for assay or production, among other things. As 
geuCTally is the case in vivo, the additional amino acids may be processed away fiom the 
mature protem by cellular enzymes. 

For each and every polynucleotide of the mvention there is provided a polynucleotide 
complementary to it It is prefOTed that these complementary polynucleotides are fully 
conq>lementary to each polynucleotide with which they are complementary. 

A precursor protein, having a mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed 
such inactive preciffsors generally are activated. Some or all of the prosequences may be 
removed before activation. Generally, such precursors are called proproteins. 

In addition to flie standard A, G, C, T/U representations for nucleotides, the term "N" may 
also be used in describing certain polynucleotides of the invention. "N" means that any of 
the four DNA or RNA nucleotides may appear at such a designated position in the DNA 
or RNA sequence, except it is preferred that N is not a nucleic acid that when taken in 
combination with adjacent nucleotide positions, when read in the correct reading frame, 
would have the effect of generating a premature termination codon in such reading frame. 

hrsum, a pK)lynucleotide of the invention may encode a mature protein, a mature protein 
plus a l e a d e r sequence (which may be referred to as a preprotein), a precursor of a mature 
protein having one or more prosequences that are not the leader sequences of a preprotein, 
era preproprotein, which is a precursor to a proprotein, having a leader sequence and one or 
more prosequences, v^ch generally are removed during processing steps that produce 
active and mature forms of the polypeptide. 
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Jn accordance with an aspect of flie invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic innnunization. 

The use of a polynucleotide of the invention in genetic immunization will preferably 
employ a suitable delivery method such as direct injection of plasmid DNA into muscles 
(Wolff a/.. Hum Mol Genet (1992) 1: 363, Manthorpe et al. Hum. Gene Ther. (1983) 4: 
419), delivery of DNA complexed with specific protein carriers (Wu et al, J Biol Chem. 
(1989) 264: 16985), coprecipitation of DNA with calcium phosphate (Benvenisty & 
Reshef, PNAS USA, (1986) 83: 9551), encapsulation of DNA in various forms of 
Uposomes (Kaneda et aL, Science (1989) 243: 375), particle bombardment (Tang et aL, 
Nature (1992) 356:152, Bisenbraun et al, DNA Cell Biol (1993) 12: 791) and in vivo 
infection \ising cloned retroviral vectors (Seeger et al, PNAS USA (1984) 81: 5849). 

Vectors. Host Ceils, Expression Svstems 

The invention also relates to vectors that conqjrise a polynucleotide or polynucleotides of 
the inveation, host cells that are genetically engmeered with vectors of the invention and the 
production of polypeptides of the invention by reconibinant techniques. Cell-free translation 
systems can also be employed to produce such proteins using RNAs derived from the DNA 
constructs of the invention. 

Recombinant polypeptides of the present invention may be prepared by processes well 
Icnown in those skilled in the art from genetically engineered host cells coniprising 
expression systems. Accordmgly, in a ftirther aspect, tihie present invention relates to 
expression systems that conq)rise a polynucleotide or polynucleotides of the present 
invention, to host cells which are genetically engmeered with such expression systems, and 
to the production of polypeptides of the invention by recombinant techniques. 
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For recombinant production of the polypeptides of the invention, host cells can be 
genetically oigineered to incorporate expression systems or pOTtions thereof or 
polynucleotides of the invention. Introduction of a polynucleotide into the host ceU can be 
effected by mefliods described in many standard laboratory manuals, such as Davis, et al, 
BASIC METHODS IN MOLECULAR BIOLOGY, (1986) and Sambrook, et al, 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1989), such as, calcium phosphate 
transfection, DEAE-dextran mediated transfection, transvection, microinjection, cationic 
lipid-mediatedtransfection, electroporation, conjugation, transduction, scrape loading, 
ballistic iatroduction and infection. 

Representative examples of ^ropriate hosts include bactraial cells, such as cells of 
stireptococci, staphjiococci, enterococci, E. colU stiieptomyces, cyanobacteria. Bacillus 
subtilis. Neisseria meningitidis. Haemophilus influenzae and MoraxeUa catarrhalis; fungal 
cells, such as cells of a yeast, Ktuveromyces, Saccharomyces, Pichia, a basidiomycete, 
Candida albicans taoA. Aspergillus; msect cells such as cells of DrosopMa S2 and 
Spodoptera Sf9; animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, 293, CV-1 and 
Bowes melanoma cells; and plant cells, such as cells of a gymnosperm or angiosperm. 

A great variety of espression systems can be used to produce the polypqjtides of the 
invention. Such vectors include, among ofliers, chromosomal-, episomal- and virus-derived 
vectors, for exan5)le, vectors derived jBrom bacterial plasmids, from bacteriophage, from 
tiransposons, from yeast episomes, from insertion elements, from yeast chromosomal 
elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses, picomaviruses, retrovirus^, 
and aiphaviruses and vectors derived from combinations fheteo]^ such as those derived fiom 
plasmid and bactadpphi^ graietic elements, such as cosmids and phagemids. The 
expression system constructs may contain contix>l regions that r^[ulate as well as engraider 
expression. Generally, any system or vector suitable to maintain, propagate or express 
polynucleotides and/orte ecpress a polypeptide in a host may be used for expression in this 
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regard The appropriate DNA sequence may be inserted into the exp^^ 

of a variety of weU-known and routine techniques, such as, for exan5)le, those set forth in 

Sambrook et aL, MOLECULAR CLONING, A LABORATORY MANUAL, {supra). 

In recombinant expression systems in eukaiyotes, for secretion of a translated protein into 
the lumen of (he endoplasmic reticulum, into the periplasmic space or into the extraceUular 
environment, appropriate secretion signals maybe incorporated into the expressed 
polypeptide. These signals may be endogenous to the polypeptide or they may be 
heterologous signals. 

Polypeptides of the present invention can be recovered and purified fiom recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, phosphocellidose 
chromatography, hydrophobic interaction chromatography, afBnity chromatography, 
hydroxyiqjatite chromatography and lectin chromatography. Most preferably, ion metal 
affinity chromatography (IMAC) is en^loyed for purification. Well known techniques 
for refolding protems may be employed to regenerate active conformation when the 
polypeptide is denatured during intracellular synthesis, isolation and or purification. 

The expression system may also be a recombinant live microorganism, such as a virus 
or bacterium The gene of interest can be inserted into the genome of a live recombinant 
virus or bacterium. Inoculation and in vivo infection with this live vector will lead to in 
vivo expression of the antig^ and induction of immune responses. Viruses and bacteria 
used for this purpose are for instance: poxviruses (e.g; vaccinia, fowlpox, canarypox), 
alphavinises (Sindbis virus, Srailiki Forest Virus, Venezueliau Equine Encephalitis 
Virus), adenoviruses, adeno-associated virus, picomaviruses (poliovirus, rhinovirus), 
herpesviruses (varicella zoster virus, etc). Listeria, Salmonella , Shigella, BCG, 
streptococci. These vuiises and bacteria can be vuulCTit, or attenuated in varioiis ways 
in order to obtain live vaccines. Such live vaccines also form part of the invention. 
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Combinations of Bordetell^ antigens in i n rnmnogenic cnmDositions 

A further aqiect of the invention discloses particular combinations of BordeteUa antigens 
which when combined, lead to an effective immunogenic composition against BordeteUa 
infection. The efficacy of the immunogenic composition is detennined as by its ability to 
elicit a protective response against B. pertussis primarily, but it is preferred that they also 
ehcit a protective effect against the related bacteria B. parapertussis and/or B. 
bronchiseptica. 



Prefenred combinations of BordeteUa antigens, when combined in an immunogenic 
composition or vaccine, aUow different BordeteUa functions to be targetted by the 
immune response. Such an immune response is better able to treat or prevent BordeteUa 
infection. For mstance, known virulence factors include adhesins like FHA, fimhtae, 
pertactin which are involved in attachment of BordeteUa to host cells; toxins such i 
pertussis toxin, adenjdate cyclase have a lole in disabling the host immune system; 
BrkA acts as a serum resistance factor and TcfA has a role in tracheal colonization. 



:as 



are 



In particular, combinations of certain antigens &am different classes, some of which 
involved in adhesion to host cells, some of which are involved in iron acquisition, some 
of which are antotransporters and some of which are toxins, can eKcit an immune 
response which protects against multiple functions of BordeteUa required to sustam 
infection. Such combinations of antigens can surprisingly lead to improved vaccine 
efficacy against BordeteUa infection where more that one function of the bacterium is 
targeted by the immune response. Preferably, the improved vaccine efficacy is against B. 
pertussis and/or B. parapertussis. 

Accordingly, the invention provides immunogenic conapositions conq)rismg at least or 
exactly two, three, preferably four, five, six. seven, eight, nine or ten different 
BordeteUa, preferably A pertussis antigens , wherein the antigens are selected fix>m at 
least two, three, four or five of the foUowing categories: 
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a) at least one Bordetella autotransporter protein selected from the groiq) consisting of 
a polypeptide sharing at least 70%, 80%; 90%, 95%, 97%, 98%, 99% or 100% 
identity with SEQ ID 34, 30, 32, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54 or 100, 
pertactin and BipA, or an antigenic or immunogenic fragment thereof, preferably a 
passenger domain thereof; 

b) at least one Bordetella iron acquisition protein selected from the group consisting of 
the polypeptide sharing at least 70%, 80%, 90%, 95%, 97%, 98%, 99% or 100% 
identity with SEQ ID 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28 or 106, or an 
antigenic or immunogenic fragment thereof; 

c) at least one Bordetella lipoprotein selected from the group consisting of the 
polypeptide sharing at least 70%, 80%, 90%, 95%, 97%, 98%, 99% or 100% 
identity with SEQ ID 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 
88, 90, 92, 94, 96, or 98 or an antigenic or inmiunogenic fragment thereof; 

d) at least one Bordetella adhesin selected from the group consisting of FHA, fimbriae, 
pertactin and BrkA or an antigenic or immunogenic fragment thereof; and 

e) at least one Bordetella toxin/invasin or antigens involved in toxiti/invasin secretion 
selected from the group consisting of pertussis toxin, adenylate cyclase, 
dermonecrotic toxin (Dnt), Type HI ss or Upopolysaccharide or an antigenic or 
immunogenic fragment thereof, 

wherein the Bordetella antigens in the immunogenic composition do not consist of any 
conabination of 2, 3, 4 or all 5 of pertactin, fimbriae 2, fimbrae 3, FHA and pertussis 
toxin. 

The combinations of the invention do not include whole cell pertussis (Pw), 

Immxmogenic compositions of the invention therefore do not cover known vaccine 
combinations (for instance vaccines where the acellular pertussis component consist of 
2, 3, 4 or 5 of FHA, pertussis toxoid, pertactin, fimbrae 2 and fimbrae 3) however a 
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single known antigen from one groiqp, combined witihi a new antigen from a different 
group is covered. 

The Bordetella antigens may derived from any strain of Bordetella including from one 
or more of J?, pertussis, B, parapertussis and B. bronchiseptica Q)referably the former). 

Preferably all five groiq)s of antigen are represented in the immunogenic composition of 
the invention. Where an antigen falls into two groups, the inclusion of that one antigen 
into an immimogenic composition leads to the inclusion of both groiq>s in the 
inomxmogenic composition. 

Where a protein is specifically mentioned herein, it is preferably a reference to a native, 
full-length protein but it may also encompass antigenic, preferably immunogenic 
fragments thereof particularly in the context of subunit vaccines). These are fiagments 
containing or comprising at least 10 amino acids, preferably at least 20 amino acids, 
more preferably at least 30 amino acids, more prefwably at least 40 amino acids or most 
preferably at least 50 amino acids, taken contiguously from the amino acid sequence of 
the protein wherein the fragment is shorter than the fiill length of the protein. 
Paticularly preferred firagments are the passenger domains of autotransporter proteins as 
defined above. In addition, antigenic firagmrats denotes fragments that are 
immunologically reactive with antibodies generated against the B. pertussis proteins or 
with antibodies generated by infection of a mammalian host with Bordetella. Antigenic 
firagments also includes fragments that when administered at an effective dose, elicit a 
protective immune response against Bordetella infection, more preferably it is protective 
against B, pertussis and/or B. parapertussis and/or B, bronchiseptica infection. 
Preferably such fragments are coiipled to a source of T - cell epitopes. 

Also included in the invention are recombinant fusion proteins of Bordetella proteins of 
the-invmtion, or firagments thereof. These may combine different Bordetella proteins or 
firagments thereof in the same pol ypeptide . A lternatively, the invention also includes 
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individual fusion proteins of Bordetella proteins or firagments thereof, as a fusion 
protein with het^ologous sequences such as a provider of T-cell epitopes or purification 
tags, for exan^jle: P-galactosidase, glutathione-S-transferase, green fluorescent proteins 
(GFP), epitope tags such as FLAG, myc tag, poly histidine, or viral surface proteins 
such as influenza virus haemagglutinin, tetanus toxoid, diphttieria toxoid, CRM197. 

Antigens of tbe invention 

1 . Autotransporter proteins 

Autotransporter proteins typically are made iq) of a signal sequence, a passenger domain 
and an anchoring domain for attachmaot to the outer membrane. Examples of 
autotransporter proteins include pertactin (SEQ ID 30), VagS (SEQ ID 32), BrkA (SEQ 
ID 34), TcfA (SEQ ID 36) (Finn and Stevens (1995) Mol. Microbiol. 16; 625-634), Phg 
(SEQ ID 38), BipA (Stockbauer et al 2001; Molecular Microbiology 39; 65-78), BapA 
(SEQ ID 40), BapB (SEQ ID 42), B^C (SEQ ID 44), pertactin-like protein (SEQ ID 
46), Tcf-like protein (SEQ ID 48), extracellular serine protease (SEQ ID 50, SEQ ID 
100). Y^E (SEQ ID 52), SphBl (SEQ ID 54). Hiese antigens maybe derived ftom 
Bordetella pertussis or Bordetella parapertussis or other Bordetella strains. 

It is particularly advantageous to use the passrager domain of an autotranporter when it 
is included in a subunit vaccine. Table 2 above defines the passsenger domains of the 
autotransporter proteins listed above. 

BipA contams 90 amino acid tandem rqpeats with 5 being present in flie B. pertussis 
protein and 8 being present in the B. bronchiseptica protein. These repeats span from 
amino acid 581 to 1030 in B. pertussis and amino acids 581 to 1300 in B. 
bronchiseptica. Preferred fragments of Bi^A include amino acids 1031 to 1308, amino 
acids 941 to 1308, amino acids 851 to 1308, amino acids 761 to 1308, amino acids 671 
to 1308 and 581 to 1308 of the B. pertussis sequence (or sequences related to these tihat 
have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40 or 50 amino acids added or deleted from 
eifiher or both of the N and C tCTmini). 
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2. Iron acquisition proteins 

Iron acquisition is of great importance to mammalian pathogens as iron is present in 
limiting conditions in the host and any iron that is present is sequestered by haem and 
other iron-chelating conq>oxmds. Iron from siderophores and host iron-binding 
con^lexes is internalised tiirough TonB-dependent outer membrane ferric complex 
receptors. Bordetella iron aquisition proteins include BfeA (SEQ ID 2), BfrB (SEQ 
ID 4), BfrC (SEQ ID 6), FauA (SEQ ID 8), ferric siderophore receptor (SEQ ID 10), 
Ferric alcaligin siderophore receptor (SEQ ID 12), iron tranport protein fiu (SEQ ID 14, 
SEQ ID 106), iron tranport protein fiu (SEQ ID 16), putative hydrxamate-type 
ferrisiderophore receptor signal peptide protein (SEQ ID 18) BhuR (SEQ ID 20) 
(Infection and Immunity 2001, 69; 6951), tonb-dependent receptor (SEQ ID 22), tonb- 
dependent receptor (SEQ ID 24), ferrisiderophore receptor-like protein (SEQ ID 26) and 
tonb-dependent receptor Yncd precurser (SEQ ID 28). These proteins may be derived 
from Bordetella pertussiss, Bordetella parapertussis or other Bordetella strains, 
preferably the former. 

3. Lipoproteins 

Bordetella lipoproteins include heme/hemopexin utilisation protein C presxirsor (SEQ 
ID 56), pifai protein (SEQ ID 58), immunogenic protein (SEQ ID 60), outer membrane 
lipoprotein precursor (SEQ ID 62), outer membrane efflux protein precursor (SEQ ID 
64), Oprm (SEQ ID 66), outer membrane channel signal protein (SEQ ID 68), MltA 
(SEQ ID 70), MltB (SEQ ID 72), yccz precursor (SEQ ID 74), serine protease 
transmembrane protein (SEQ ID 76), pa4632 (SEQ ID 78), coml precursor (SEQ ID 
80), VacJ (SEQ ID 82), outer membrane lipoportein (SEQ ID 84), Flagelar l-ring 
protdn (SEQ ID 86), Ydcl (SEQ ID 88), Pal (SEQ ID 90), OmlA (SEQ ID 92), 
Smc00354 (SEQ ID 94), Pep (SEQ ID 96) and Upoprotein (SEQ ID 98). 
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The lipoproteins having the sequence of SEQ ID 56-96 contain a lipidation motif 
indicating that they would be lipidated and inserted into the membrane. In its sin5)lest 
form, the lipidation motif contains the concensus sequence LXXC. However, the 
concensus sequence is preferably close to the amino terminus of the sequence, within 
the larger concensus sequence:- 

<(M, V, {1, 40}'KD, E, R, K,*)6(L, I, V, M, F, W, S, T, A, G)2(L, I, V, M, F, Y, S, 
T,A,G,C,Q)(A,G,S)C 

< indicates the amino termiaus of the protein so that the first amino acid should be M, V 
or L. {1, 40} indicates that between 1 and 40 amino acids should be present between the 
first amino acid and the rest of the concensus sequence. (D, E, R, K, *)6 indicates that 
the next 6 amino acids should not be D, E, R or K. The following 2 amino acids should 
be one of the aliphatic amino acids indicated and is preferably L. For the following 2 
amino acids, the amino acids shown in flie parentheses should be present and the final 
amino acid of the sequence shold be C. These antigens may be derived fix>m Bordetella 
pertussis or Bordetella parapertussis, Bordetella bronckoseptica or other Bordetella 
strains, inreferably the former. 

4, Adhesins 

Adhesins have a role in attaching Bordetella to a host cell and hence have important 
roles in virulence. They include filamentous haemagglutinin (FHA) (Relman et al 
(1989) Proc Natl Acad Sci USA 86; 2634-2641), fimbriae (Fim) (Mooi et al (1992) 
Microb Pathog 12; 127-135), pertactin (Roberts et al (1991) Mol Microbiol 5; 1393- 
1404) and BrkA ( Fernandez et al (1994) Infection and Immimity 62; 4727-4738). 
These antigens may be derived fi*om Bordetella pertussis or Bordetella parapertussis, 
Bordetella bronckoseptica or other Bordetella strains, preferably the former. 

Fimbriae or Fim proteins are also known as aggutinins or fimbrial adhesins. Hie term 
Fim comprises fimbriae 2 and fimbriae 3. 
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5. Toxins 

Toxins include adenylate cyclase (CyaA) (Hewlett et al (1989) J. BioLChem. 264; 
19379-19384), pertussis toxin (Mimoz et al (1981) Infect Tmmun 33; 820-826), dermo- 
necrotic toxin (Dnt) (Livey (1984) J. Med. Microbiol. 17; 91-103 and 
lipopolj^accharides. Toxins also include proteins that are involved in the secretion of 
toxins since an immune response against the secretory mechanism would prevent the 
efficient functioning of the secretory mechanism and lead to reduced toxin secretion. An 
example of such an antigen is the Type m secretion system (Yuk et al (2000) Mol. 
Microbiol. 35; 991-1004). These antigens maybe derived from Bordetella pertussis or 
Bordetella parapertussis, Bordetella bronchoseptica or other Bordetella strains, 
preferably the former. 

Preferred fragments of adenylate cyclase comprise amino acids 385-399 or sequ^ces 
related to this that have 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40 or 50 amino acids added to 
either or both of the N and C termini. Preferred fragments are disclosed in EP4245 1 8, 
EP787796 or WO 90/13312. 

Where toxin is described herein, non-toxic derivatives such as toxoids or mutant toxins 
are also envisaged to be covered by the term. 

Preferred combinations 

In any of the preferred combinations listed below, the term antigen corcqprises 
inmiunogenic fragments of that antigen. 

hi a preferred embodiment of the combination of antigens of the invention, the 
immunogenic composition comprises at least one Bordetella iron acquisition protein 
selected from the groiq) consisting of the polypeptide sharing at least 70%, 80%, 90%, 
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95%, 97%, 98%, 99% or 100% amino acid identity with SEQ ID 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 106 or an antigenic or inamunogenic fragment thereof. 

In a particularly preferred embodiment of the invention, the immunogenic composition 
coni^rises one, two or ttiree of FHA, pertussis toxin and pertactin, (preferably FBLA. and 
PT; FHA and pertactin; PT and pertactin; or EEIA, pertussis toxin and pertactin) and 
further comi>rises at least one Bordetella iron acquisition protein selected from the 
group consisting of the polypeptide sharing at least 70%, 80%, 90%, 95%, 97%, 98%, 
99% or 100% amino acid identity with SEQ ID 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28,106 or an antigenic or inamunogenic fragment thereof. 

In a further preferred embodiment of the invention, the immunogenic coiiq)Osition 
comprises at least one Bordetella autotransporter protein selected from the group 
consisting of BipA, the polypq)tide sharing at least 70%, 80%, 90%, 95%, 97%, 98%, 
99% or 100% amino acid identity with SEQ ID 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 
54,100, or an antigenic or immunogenic fragment thereof, preferably the passenger 
domain thereof. 

In a particularly preferred embodiment of Ihe invention, the immunogenic composition 
comprises one, two or three of FHA, pertussis toxin and pertactin, (preferably FHA and 
PT; FHA and pertactin; PT and pertactin; or FHA, pertussis toxin and pertactin), and 
fijTther comprises at least one Bordetella autotransporter protein selected from the 
group consisting of BipA, the polypeptide sharing at least 70%, 80%, 90%, 95%, 97%, 
98%, 99% or 100% amino acid identity with SEQ ID 32, 34, 36, 38, 42, 46, 48, 50, 52, 
54, 100, or an antigenic or immunogenic fragment thereof, preferably the passenger 
domain thereof. 

In a frirther preferred embodiment of the invention, the immunogenic conqsosition will 
conq)rise at least one Bordetella lipoprotein selected from the group consisting of the 
polypeptide sharing at least 70%, 80%, 90%, 95%, 97%, 98%, 99% amino acid identity 
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With SEQ ID 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 
96, 98 or an antigenic or immunogenic £raginent thereof, 

hi a particularly preferred embodiment of the mvention, the uimiunogenic composition 
con5>rises one, two or three of FHA, pertussis toxin andpertactin, (preferably FHA and 
FT; FHA and pertactin; FT and pertactin; or FHA, pertussis toxin and pertactin) at least 
one Bordetella lipoprotein selected from the group consisting of the polypeptide sharing 
at least 70%, 80%, 90%, 95%, 97%, 98%, 99% amino acid identity with SEQ ID 56, 58, 
60, 62, 64, 66, 68, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 94, 96, 98 or an antigenic or 
immunogenic fragment thereof. 

hi a further preferred embodiment of the invention, the immunogenic composition 
conq)rises BrkA or an antigenic or unmunogenic fragmmt thereof. 

Iq a further preferred embodiment of the invention, the immunogenic conQK>sition 
conaprises at least one Bordetella toxin or antigens involved m toxin secretion selected 
from the group consisting of adenylate cyclase, dermonecrotic toxin (Dnt), Type III ss or 
lipopolysaccharide or an antigenic or immunogenic fragment thereof. 

The combination of FHA, pertussis toxin and pertactin is known to elicit a protective 
immune reponse and particularly preferred combinations of the invention will contain 
one, two or three of these constituents, optionally further comprising fim 2, fim 3 or fim 
2 and fim 3. 

A- pre f e rred immunogenic composition of the invention contains TcfA and at least 1, 2, 
3, 4, antigens selected from the list consisting of pertussis toxin, adenylate cyclase, LPS, 
BipA, Type IIl3S,BhuR, FHA, Fim, pertactin, BikA, MhA, Klltfi, VacJ, QmlA and 
Pep. Preferred combinations conqnise TcfA and pertussis toxin, (optionally with 1, 2, 3, 
4-or 5 of BipA,BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, QmlA and Pep); 
TcfA-and adenylate^cyclase (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, 
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pertactin, BikA, MltA, MltB, VacJ, OmIA and Pep); TcfA and LPS (optionaUy with 1, 
2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, OmlA and 
Pq>); TcfA and BipA (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate 
cyclase, LPS, Type Hiss, BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, OmlA 
and Pep); TcfA and Type Ilfas (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, BipA, 
BhuR , FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, OmlA andPcp); TcfA and BhuR 
(optionally witii 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, Type 
mss, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, OmlA, PCP); TcfA and FHA 
(optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, 
pertactin, BhuR, Type UIss, MltA, MltB, VacJ, OmlA, Pep); TcfA and a lipoprotem 
selected from MltA, MltB, VacJ, OmlA and Pep (optionally wifli 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, BipA, BhuR, Type Ilfes, Fun, pertactin, BrkA 
and FHA). 

A further preferred immunogenic con^sition of the invention contains BipA and at 
least 1, 2, 3, 4, antigens selected ftom the list consisting of pertussis toxin, adenylate 
cyclase, LPS, Type JE ss, BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, OmlA 
and Pep. Preferred combinations con^nise BipA and pertussis toxin, (optionally with 1, 
2, 3, 4 or 5 of BhuR, FHA, pertactin, Fim, BrkA, MltA, MltB, VacJ, OmlA and Pep), 
BipA and adenylate cyclase (optionally with 1, 2, 3, 4 or 5 of BhuR, FHA, pertactin, 
Fim, BrkA, MltA, MltB, VacJ, OmlA and Pep); BipA and LPS (optionally with 1, 2, 3, 

4 or 5 of BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, OmlA and Pep); BipA 
and Type UIss (optionally wilh 1, 2, 3, 4 or 5 of BhuR , FHA, Fim, pertactin, BrkA, 
pertussis toxin, MltA, MltB, VacJ, OmlA and Pep); BipA and BhuR (optionally with 1, 
2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, Type Diss, FHA, Fun, pertactin, 
BrkA, MltA, MltB, VacJ, OmlA and PCP); BipA and FHA (optionally with 1, 2, 3, 4 or 

5 of pertussis toxin, adenylate cyclase, LPS, pertactin, BhuR, Type Hiss, MltA, MltB, 
VacJ, OmlA and PCP); BipA and a Upopiotein selected from MltA, MltB, VacJ, OmlA 
and Pep (optionally witit 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, 
BhuR, Type Hiss, Fim, pertactin, BrkA and FHA). 
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A furfher preferred umnunogenic coooposition of fte invention contains BapA and at 
least 1, 2, 3, 4, antigens selected from the list consisting of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type m ss, BhuR, FHA, Fim, pertactin, BrkA , NfltA, MltB, VacJ, 
OmlA and Pep. Preferred combinations comprise BapA and pertussis toxin, (optionally 
with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, 
OmlA and Pep); BapA and adenjdate cyclase (optionally with 1, 2, 3, 4 or 5 of BipA, 
BhuR, FHA, Fim, pertactin, BrkA, pertussis toxin, MltA, MltB, VacJ, OmlA and Pep); 
BapA and LPS (optionaUy with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, 
BrkA, MltA MltB, VacJ, OmlA and Pep); BapA and BipA (optionally with 1, 2, 3, 4 
or 5 of pertussis toxin, adenylate cyclase, LPS, Type Hiss, BhuR, FHA, Fim, pertactin, 
BrkA, IVfltA, MltB, VacJ, OmlA and Pep); B^A and Type lEbs (optionaUy with 1, 2, 
3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, BhuR , FHA, Fim, pertactin, 
BikA, MltA, Mltfi, VacJ, OmlA and Pep); BapA and BhuR (optionally with 1, 2, 3, 4 
or 5 of pertussis toxm, adenylate cyclase, LPS, BipA, T^e Hiss, FHA, Fim, pertactin, 
BrkA, MltA, MltB, VacJ, OmlA and Pep); B^A and FHA (optionaUy with 1, 2, 3, 4 or 
5 of pertussis toxin, adenylate cyclase, LPS, BipA, BhuR, Fim, pertactin, BrkA, Type 
nfes, MltA, MltB, VacJ, OmlA and Pep); BapA and a lipoprotein selected from MltA, 
MltB, VacJ, OmlA and Pep (optionaUy with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate 
cyclase, LPS, BipA, Fim, pertactin, BikA, BhuR, Type Hiss and FHA), provided that 
the combination is not BapA and PT and FHA. 

A fiiriha- preferred immunogenic composition of the invention contains BapB and at 
least 1, 2, 3, 4, antigens selected from the list consisting of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type HI ss, BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, 
OmlA and Pep. Preferred combinations comprise BapB and pertussis toxin, (optionaUy 
wife 1, 2, 3, 4 or 5 of BipA, BhuR. FHA, Fim, pertactin, BrkA, MltA, MKB, VacJ, 
OmlA and Pep); BapB and adenylate cyclase (optionaUy with 1, 2, 3, 4 or 5 of BipA, 
BhuR, FHA, Fim, pertactin, BikA, MltA MltB, VacJ, OmlA and Pep); Bq>B and LPS 
(optionaUy with 1, 2, 3, 4 or 5 of BipA, msvR, FHA, Fim, pertactin, BrkA, MltA, 
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MltB, VacJ, QmlA and Pep); BapB and BipA (optionaUy with 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, Type Diss, BhuR, FHA, Fim, pertactin, BrkA, 
MltA, MltB, VacJ, QmlA and Pep); BapB and Type Hiss (optionaUy with 1, 2, 3, 4 or 5 
of pertussis toxio, BipA, BhuR , FHA, Fim, pertactin, BikA, MltA, MltB, VacJ, QmlA 
and Pep); BapB and BhuR (optionally wilh 1, 2, 3, 4 or 5 of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type Diss, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, OmlA 
and Pep); BapB and FHA (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate 
cyclase, LPS, BipA, Fim, pertactin, BrkA, BhuR, Type Hiss, MltA, MltB, VacJ, QmlA 
and Pep); BapB and a lipoprotein selected from MltA, MltB, VacJ, OmlA and Pep 
(optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adraylate cyclase, LPS, BipA, BhuR, 
Type mss, Fim, pertactin, BrkA and FHA). 

A further preferred immunogenic composition of the invention contains BapC and at 
least 1, 2, 3, 4, antigens selected from the list consisting of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type HI ss, BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, 
OmlA and PCP- Preferred combinations comprise BapC and pertussis toxin, (optionally 
with 1,2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, 
QmlA and Pep); BapC and adenylate cyclase (optionally with 1, 2, 3, 4 or 5 of BipA, 
BhuR, FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, QmlA and Pep); BapC and LPS 
(optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, MltA, 
MltB, VacJ, QmlA and Pep); BapC and BipA (optionaUy with 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, Type Hiss, BhuR, FHA, Fim, pertactin, BrkA, 
MltA, MltB, VacJ, QmlA and Pep); BapC and Type mss (optionally with 1, 2, 3, 4 or 5 
of pertussis toxin, BipA, BhuR , FHA, Fim, pertactin, BrkA, MltA, MltB, VacJ, QmlA 
and Pep); BapC and BhuR (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type Diss, FHA, Fim, p^tactin, BrkA, MltA, MltB, VacJ, OmlA 
and Pep); BapC and FHA (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate 
cyclase, LPS, BipA, BhuR, Type Hiss, pertactin, MltA, Mlffl, VacJ, QmlA and Pep); 
BapC and a lipoprotein selected from MltA, MltiB, VacJ, QmlA and PCP (optionally 
with 1, 2, 3, 4 or 5 of pertussis luxin, adenylate cyclase, LPS, BipA, BhuR, Type IBfes, 
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Fim, pertactm, BrkA and FHA), provided that the combination is not B^C and PT and 
FHA. 

A fiirther prefeired immunogenic composition of the invCTition contains pertactin and at 
least 1, 2, 3, 4, antigCTs selected from the list consisting of pertussis toxxa, adenylate 
cyclase, LPS, BipA, Type III ss, BhuR, FHA, Fim, BrkA, MltA, MltB, VacJ, OmlA and 
PCP. Preferred combinations comprise pertactin and pertussis toxin, (optionally with 1, 

2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, BrkA, MltA, MltB, VacJ, OmlA and Pep); 
pertactin and adenylate cyclase (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, 
Fim, BrkA, MltA, MltB, VacJ, OmlA and Pep), pertactin and LPS (optionally with 1, 2, 

3, 4 or 5 of BipA, BhuR, FHA, Fim, BrkA, MltA, MltB, VacJ, OmlA and Pep); 
pertactin and BipA (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, 
LPS, Type Hiss, BhuR, FHA, Fim, BrkA, MltA, MltB, VacJ, OmlA and Pep); pertactin 
and Type Hiss (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, BipA, BhuR , FHA, 
Fim, BrkA, MltA, MltB, VacJ, OixilA and Pep); pertactin and BhuR (optionally with 1 , 
2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, Type Hiss, FHA, Fina, 
BrkA, MltA, MltB, VacJ, OmlA and Pep); pertactin and FHA ( optionally with 1, 2, 3, 
4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, BhuR, Type Hiss, MltA, MltB, 
VacJ, OmlA and Pep), pertactin and a lipoprotein selected from MltA, MltB, VacJ, 
OmlA and PCP (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, 
LPS, BipA, BhuR, Type Hiss, Fim, BrkA and FHA). 

A further preferred immunogenic composition of the invention contains pertactin-like 
protein and at least 1, 2, 3, 4, antigens selected from the list consisting of pertussis 
toxin, adenylate cyclase, LPS, BipA, Type HI ss, BhuR, FHA, pertactin, Fim, BrkA, 
MltA, MltB, VacJ, OmlA and Pep. Prefeired combinations comprise pertactin-like 
protein and pertussis toxin, (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, 
pertactin, Fim, BrkA, MltA, MltB, VacJ, OmlA and Pep); pertactin-like protein and 
adenylate cyclase (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, pertactin, Fim, 
BrkA, MltA, MlfB. VacJ, OmlA and Pep); pertactin-like jxrotein and LPS (optionally 
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withl,2, 3,4or5of BipA,BhuR,FHA,pertactin,Fim,BrkA^ MltA, MltB, VacJ, 
OmlA and PCP); pertactin-like protein and BipA (optionally with 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, Type Hiss, BhuR, FHA, pertactin, Fim, BrkA, 
MltA, MtB, VacJ, OmlA and Pep); pertactin-like protein and Type Diss (optionally 
with 1, 2, 3, 4 or 5 of pertussis toxin, BipA, BhuR , FHA, pertactin, Fina, BrkA, MltA, 
Mlffl, VacJ, QmlA and Pep); pertactin-like protein and BhuR (optionally with 1, 2, 3, 4 
or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, Type Hiss, FHA, pertactin, Fim, 
BrkA, MltA, MltB, VacJ, OmlA, Pep); pertactin-like protein and FHA (optionally with 
1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, BhuR, pertactin, Fim, 
BrkA, Type Hiss, MltA, MltB, VacJ, OmlA, Pep); pertactin-like protein and a 
Upoprotein selected from MltA, MltB, VacJ, OmlA and Pep (optionally with 1, 2, 3, 4 
or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, BhuR, Type Hiss, pertactin, Fim, 
BrkA and FHA). 

A further preferred immunogenic composition of the invention contains YapE and at 
least 1, 2, 3, 4, antigens selected from the list consisting of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type III ss, BhuR, FHA, pertactin, Fim, BrkA, MltA, MltB, VacJ, 
OmlA, Pep. Preferred combinations comprise YapE and pertussis toxin, (optionally 
wifli 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, pertactin, Fim, BrkA, MltA, MltB, VacJ, 
QmlA, Pep); YapE and adenylate cyclase (optionally with 1, 2, 3, 4 or 5 of BipA, 
BhuR, FHA, pertactin, Fim, BrkA, MltA, MltB, VacJ, OmlA, Pep); YapE and LPS 
(optionally witti 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, pertactin, Fim, BrkA, MltA, 
MltB, VacJ, OmlA, Pep); YapE and BipA (optionally with 1, 2, 3, 4 or 5 of pertussis 
toxin, adenylate cyclase, LPS, Type UIss, BhuR, FHA, pertactin, Fim, BrkA, MltA, 
MltB, VacJ, OmlA, Pep); Y^E and Type Diss (optionally with 1, 2, 3, 4 or 5 of 
pertussis toxin, BipA, BhuR , FHA, pertactin, Fim, BrkA, MltA, MltB, VacJ, QmlA, 
Pep); YapE and BhuR (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type Diss, FHA, pertactin, Fim, BrkA, MltA, MltB, VacJ, QmlA, 
Pep); YapE and FHA (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, aden>date 
cyclase, LPS, BipA, BhuR, Type Hiss, pertactin, MltA, MltB, VacJ, QmlA, Pep); YapE 
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and a lipoprotein selected from MltA, MltB, VacJ, OmlA, Pep (optionally with 1, 2, 3, 
4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, BhuR, Type Hiss, pertactin, 
Fim, BrkA and FHA), 

A fut&er prefixed immunogenic composition of tiie invention contains BrkA and at 
least 1, 2, 3, 4, antigens selected from the list consisting of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type m ss, BhuR, FHA, pertactm, Fim, MltA, MltB, VacJ, OmlA, 
Pep. PrefCTred combinations comprise BrkA and pertussis toxin, (optionally with 1, 2, 3, 
4 or 5 of BipA, BhuR, FHA, pertactin, Fim, MltA, MltB, VacJ, OmlA, Pep); BrkA and 
adenylate cyclase (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, pertactin, Fim, 
MltA, MltB, VacJ, OmlA, Pep); BrkA and LPS (optionally with 1, 2, 3, 4 or 5 of BipA, 
BhuR, FHA, pertactin, Fim, MltA, MltB, VacJ, OmlA and Pep); BrkA and BipA 
(optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, Type Diss, 
BhuR, FHA, pertactin, Fim^ MltA, MltB, VacJ, OmlA, Pep), BrkA and Type Hiss 
(optionally with 1, 2, 3, 4 or 5 of pertussis toxin, BipA, BhuR , FHA, pertactin, Fim, 
MltA, MltB, VacJ, OmlA, Pep); BrkA and BhuR (optionally witii 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, BipA, Type lUss, FHA, pertactin, Fina, MltA, 
MltB, VacJ, OmlA, Pep); BrkA and FHA (optionally with 1, 2, 3, 4 or 5 of pertussis 
toxin, adenylate cyclase, LPS, BipA, BhuR, Type Hiss, pertactin, MltA, MltB, VacJ, 
OmlA, Pep); BrkA and a lipoprotein selectd from MltA, MltB, VacJ, OmlA and Pep 
(optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, BhuR, 
Type niss, pertactin, Fim and FHA). 

A fruHier preferred immunogenic coniposition of the invention contains FHA and at 
leastl, 2, 3, 4, antigens selected from the list consisting of pertussis toxin, adenylate 
cyclase, LPS, BipA, Type m ss, BhuR, BrkA pertactin, Fim, MltA, MltB, VacJ, OmlA 
and Pep. Preferred combinations comprise FHA and pertussis toxin, (optionally with 1, 
2,3,4or5of BipA, BhuR, BrkA, pertactin, Fim , MltA, Mlffl, VacJ, OmlA and Pep); 
FHA and adenjdate cyclase (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, BikA, 
pertactin, Fim , MltA, MltB, VacJ, OmlA and Pep); FHA and LPS (optionally with 1, 
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2, 3, 4 or 5 of BipA» BhuR, BrkA, pertactin, Fim , MltA, MltB, VacJ» QmlA and Pep); 
FHA and BipA (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, 
LPS, Type Hiss, BhnR, BrkA, pertactin, Fim , MltA, MltB, VacJ, QmlA and Pep); FHA 
and Type Hiss (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, BipA, BhuR, BrkA, 
pertactin, Fim, MltA, MltB, VacJ, OmlA and Pep); FHA and BhuR (optionally with 1, 
2y 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, Type Hiss, BrkA, 
pertactin, Fim , MltA, MltB, VacJ, QmlA and Pep); FHA and a lipoproteiQ selected 
from MltA, MltB, VacJ, QmlA and Pep (optionally with 1, 2, 3, 4 or 5 of pertussis 
toxin, adenylate cyclase, LPS, BipA, BhuR, Type Hiss, pertactin, Fim and BrkA), 

A further preferred immunogenic composition of the invention contains BhuR and at 
least 1, 2, 3, 4, antigens selected from the list consisting of pertussis toxin, adenylate 
cyclase, LPS, Tcf^ BapA, BapB, BapC, pertactin, pertaetin-like protein, YapE, BrkA, 
BipA, Type mss, FHA, Fim, MltA, MltB, VacJ, QmlA and Pep. Preferred 
combinations comprise BhuR and pertussis toxin (optionally with 1, 2, 3, 4 or 5 of 
TcfA^ BapA, BapB, BapC, pertactin, pertactin-like protein, YapE, BrkA, BipA, FHA, 
Fim , MltA, MltB, VacJ, QmlA and Pep); BhuR and adenylate cyclase (optionally with 
1, 2, 3, 4 or 5 of TcfA, BapA, BapB, BBopC^ pertactin, pertactin-like protein, YapE, 
BrkA, BipA, pertussis toxin, FHA, Fim, BrkA , MltA, MltB, VacJ, QmlA and Pep); 
BhuR and LPS (optionally with 1, 2, 3, 4 or 5 of TcfA., BapA, BapB, BapC, pertactin, 
pertactin-like protein, YapE, BrkA, BipA, pertussis toxin, FHA, Fim, BrkA , MltA, 
MltB, VacJ, QmlA and Pep); BhuR and TcfA. (optionally with 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, BipA, Type mss, FHA, Fim, pertactin, BrkA , 
MltA., MltB, VacJ, QmlA and Pqp); BhuR and BapA (optionally with 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, BipA, Type lUss, FHA, Fim, pertactin, BrkA , 
MltA, MltB, VacJ, QmlA and Pep); BhuR and BapB (optionally with 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, BipA, Type Hiss, FHA, Fim, pertactin, BrkA , 
MltA, MltB, VacJ, QmlA and ?cp); BhuR and BapC (optionally with 1 , 2, 3, 4 or 5 of 
pertussis toxin, ad^ylate cyclase, LPS, Bq>A, Type Hiss, FHA, Fim, pertactin, BrkA , 
MltA, MltB, VacJ, QmlA and Pep); BhuR and pertactin (optionally wifli 1, 2, 3, 4 or 5 
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of pertussis toxin, adenylate cyclase, LPS, BipA, Type Hiss, FHA, Fim, pertactin, BrkA 
, MltA, MltB, VacJ, QmlA and Pep); BhuR and pertactin-like protein (optionally with 
1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, Type Hiss, FHA, Fim, 
pertactin, BrkA , MltA, MltB, VacJ, OmlA and Pep); BhuR and YapE (optionally with 
1, 2, 3, 4 or 5 of pertussis toxixi, adenylate cyclase, LPS, BipA, Type UIss, FHA, Fim, 
pertactin, BrkA , MltA, MltB, VacJ, OmlA and Pep); BhuR and BrkA (optionally with 
1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, Type niss, FHA, Fim, 
pertactin, BrkA , MltA, MltB, VacJ, QmlA and Pep); BhuR and a lipoprotein selected 
from MltA, MltB, VacJ, QmlA and Pep (optionally with 1, 2, 3, 4 or 5 of pertussis 
toxin, adenylate cyclase, LPS, TcfA, BapA, BapB, BapC, pertacin, pertactin-like 
protein, YapE, BrkA, BipA, Type Diss, Fim and FHA). 

A further prefOTediiomunogenicconttposition of the m^ MltA and at 

least 1, 2, 3, 4, antigens selected from the Hst consisting of pertussis toxin, adenylate 
cyclase, LPS, TcfA, BapA, BapB, BapC, pertactin, pertactin-like protein, YapE, BrkA, 
BipA, Type m ss, BhuR, Fim and FHA. Preferred combinations comprise MltA, and 
pertussis toxin (optionally with 1, 2, 3, 4 or 5 of TcfA, BapA, BapB, BapC, pertactin, 
pertactin-like protein, YapE, BrkA, BipA, FHA, Fim and BhuR); MltA and adenylate 
cyclase (optionally with 1, 2, 3, 4 or 5 of TcfA, BapA, BapB, BapC, pertactin, 
pertactin-like protein, YapE, BrkA, BipA, pertussis toxin, FHA, Fim and BhuR); MltA 
and LPS (optionally with 1, 2, 3, 4 or 5 of TcfA, BapA, BapB, BapC, pertactin, 
pertactin-like protein, YapE, BrkA, BipA, pertussis toxin, FHA, Fim and BhuR); MltA 
and TcfA (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, 
BipA, Type Hiss, FHA, Fim, pertactin, BrkA and BhuR); MltA and BapA (optionally 
with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, Type UIss, FHA, 
Fim, pertactin, BrkA and BhuR); MltA and BapB (optionally with 1, 2, 3, 4 or 5 of 
pertussis toxin, adenylate cyclase, LPS, BipA, Type lUss, FHA, Fim, pertactin, BrkA 
and BhuR); MltA and BapC (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, 
adenylate cyclase, LPS, BipA, Type IHss, FHA, Fim, pertactin, BrkA and BhuR); MltA 
and-pertactin (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, 
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BipA, Type Hiss, FEIA, Fim, BrkA and BhuR); MltA and pertactin-Iike protein 
(optionaUy with 1, 2, 3, 4 or 5 of pertussis toxin, adenjdate cyclase, LPS, BipA, Type 
mss, FHA, Fim, pertactin, BrkA and BhuR); MltA. and YapE (optionally wifli 1, 2, 3, 4 
or 5 of pertussis toxin, adenylate cyclase, LPS, BipA, Type Diss, FHA, Fim, pertactin, 
BikA and BhuR); MtA and BrkA (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, 
adenylate cyclase. UPS, BipA, Type Hiss, FEIA, Fim, pertactin and BhuR); MltA and 
BhuR (optionally with 1, 2, 3, 4 or 5 of pertussis toxin, adenylate cyclase, LPS, TcfA, 
BapA, BapB, BapC, pertacin, pertactin-like protein, YapE, BrkA, BipA, Type Diss, Fim 
and FHA), provided that the combination is not jVfltA and PT and FHA. 

A further preferred immunogenic composition of the invention contains pertussis toxin 
and at least 1, 2, 3, 4, antigens selected from the list consisting of TcfA, BapA, B^B, 
BapC, pertactin, pertactin-like protem, YapE, BrkA, BipA, BhuR, FHA , MltA,MltB, 
VacJ, OmlA and Pep. Preferred combinations conqnise pertussis toxin and TcfA 
(optionally wifli 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA , MltA, 
MltB, VacJ, QmlA and Pep); pertussis toxin and BapA (optionally with 1, 2, 3, 4 or 5 of 
BipA, BhuR, FHA, Fim, pertactin, BrkA , MltA, MltB, VacJ, QmL^ and Pep); pertussis 
toxin and BapB (optionaUy witii 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, 
BikA , MltA, MltB, VacJ, OmlA and Pep); pertussis toxm and BapC (optionally with 
1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA , MltA, MltB, VacJ, OmlA 
and Pep); pertussis toxin and pertactin (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, 
FHA, Fim, BrkA , MltA, MltB, VacJ, OmlA and Pep); pertussis toxin and pertactin- 
like protein (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA 
, MltA, MltB, VacJ, OmlA and Pep); pertussis toxin and YapE (optionally with 1, 2, 3, 
4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA , MltA, MltB, VacJ, OmlA and 
Pq>); patussis toxin and BricA (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, 
Fim, pertactin , MltA, MltB, Va;J, OmlA and Pep); pertussis toxin and BhuR 
(optionaUy wifli 1, 2, 3, 4 or 5 of TcfA, BapA, B^B, BapC, pertactin, pertactin-Uke 
protem, Y^E, BrkA, BipA, FHA, Fim , MltA, MltB, VacJ, OmlA and Pep); perhissis 
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toxin and MltA, MltB (optionally with 1, 2, 3, 4 or 5 of TcfA, BapA, BapB, BapC, 
pertactin, pertactin-like protein, Y^E, BikA, BipA, FHA, Fim and BhuR). 

A furdier preferred imnmnogemc composition of the invention contains adenylate 
cyclase and at least 1, 2, 3, 4, antigens selected ftonai the list consisting of Tc£<5^ BapA, 
BapB, BapC, pertactin, pertactin-like protein, YapE, BrkA, BipA, BhuR, FHA, Fim, 
pertussis toxin , MltA, MltB, VacJ, OmlA and Pop. Preferred combinations comprise 
adenylate cyclase and TcfA. (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, 
pertactin, BrkA, pertussis toxin , MltA, MltB, VacJ, OmIA and Pep); adenylate cyclase 
and BapA (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, 
pertussis toxin , MltA, MltB, VacJ, OmlA and Pep); adenylate cyclase and BapB 
(optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, pertussis 
toxin , MltA, MltB, VacJ, OmlA and Pep); adenylate cyclase and B^C (optionally 
with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, pertussis toxin , MltA, 
MltB, VacJ, OmlA and Pep); adenylate cyclase and pertactin (optionally with 1, 2, 3, 4 
or 5 of BipA, BhuR, FHA, Fun, pertactin, pertussis toxin , MltA, MltB, VacJ, OmlA 
and Pep); adenylate cyclase and pertactin-like protein ((^tionally with 1, 2, 3, 4 or 5 of 
BipA, BhuR, FHA, Fim, pertactin, BrkA, pertussis toxin , MltA, MltB, VacJ, OmlA 
and Pep); adenylate cyclase and YapE (optionally with 1 , 2, 3, 4 or 5 of BipA, BhuR, 
FHA, Fim, pertactin, BrkA, pertussis toxin , MltA, MltB, VacJ, OmlA and Pep); 
adenylate cyclase and BrkA (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, 
pertactin, pertussis toxin , MltA, MltB, VacJ, OmlA and Pep), adenylate cyclase and 
BhuR (optionally with 1, 2, 3, 4 or 5 of TcfA, BapA, BapB, BapC, pertactin, pertactin- 
like protein, YapE, BrkA, BipA, FHA, Fim pertussis toxin , MltA, MltB, VacJ, OmlA 
and Pep); adenjdate cyclase and MltA, MltB (optionally with 1, 2, 3, 4 or 5 of TcfA, 
BapA, B^B, B^C, pertactin, pertactin-like protein, YapE, BikA. BipA, FEIA, Fim, 
pertussis toxin and BhuR). 

A further preferred immunogenic composition of the invention contains LPS and at least 
1, 2, 3, 4, antigens selected jfirom fbe list consisting of TcfA, BapA, Bs^B, BapC, 
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pertactin, partactin-like protem, Y^E, BrkA, BipA, BhuR, FHA, Fim, pertussis toxin , 
MltA, MltB, VacJ, OmlA and ?cp. Piefecred combinations conqwise LPS and TcfA 
(optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, pertussis 
toxin , MltA, MltB, VacJ, OmlA and Pep); LPS and BapA (optionally with 1, 2, 3, 4 or 
5 of BipA, BhuR, FHA, Fim, pertactin, BikA, pertussis toxin , MltA, MltB, VacJ, 
OmlA and Pep); LPS and BapB (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, 
Fim, pertactin, BrkA, pertussis toxin , MltA, MltB, VacJ, OmlA and Pep); LPS and 
BapC (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, BrkA, 
pertussis toxin , MltA, MltB, VacJ, OmlA and Pep); LPS and pertactin (optionally witii 
1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, BrkA, pertussis toxin , MltA, MltB, VacJ, 
OmlA and Pep); LPS and pertactin-like protein (optionally with 1, 2, 3, 4 or 5 of BipA, 
BhuR, FHA, Fim, pertactin, BrkA, pertussis toxin , MltA, MltB, VacJ, OmlA and Pep); 
LPS and YapE (optionally with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, 
BrkA, pertiissis toxin , MltA, MltB, VacJ, OmlA and Pqp); LPS and BrkA (optionally 
with 1, 2, 3, 4 or 5 of BipA, BhuR, FHA, Fim, pertactin, pertussis toxin , MltA, MltB, 
VacJ, OmlA and Pep); LPS and BhuR (optionally with 1, 2, 3, 4 or 5 of TcfA, BapA, 
BapB, BapC, pertactin, pertactin-like protem, YapE, BikA, BipA, FHA, Fim, pertussis 
toxin , MltA, MltB, VacJ, OmlA and Pep); LPS and MltA (optionally with 1, 2, 3, 4 or 
5 of TcfA, BapA, BapB, BapC, pertactin, pertactin-like protein, YapE, BikA, BipA, 
FHA, Fim and pertussis toxin and BhuR). 

A further preferred combination of the invention contains one, two or three of FHA, 
pertactin, pertussis toxin (preferably FHA and pertactin, FHA and pertussis toxin, 
pertactin and pertussis toxin or FHA, pertactin and pertussis toxin) and an additional 1, 2, 
3 or 4 antigens selected from the group consisting of TcfA, BipA, BapA, BapB, B^C, 
pertactin-like protein, YapE, BhuR, Fim, BrkA, adenylate cyclase. Type m ss , MltA, 
MltB, VacJ, OmlA and Pep. A preferred combinations contains FHA, pertactin, pertussis 
toxin and BhuR. A fiirthCT preferred combination contains FHA, pertactin, pertussis toxin , 
and MltA. A iiirth^ pFeferred combination contains FHA, pertactin, pertussis toxin , and 
MtB. A furdierprefeniedconibination contains FHA, pertactm, pertussis toxin and VacJ. 
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A further preferred combination contains FHA» peitactin, pertussis toxin and OmlA. A 
further preferred combination contains FHA, pertactin, pertussis toxin and Pep. A further 
preferred combination contains FHA, pertactin, pertussis toxin, a lipoprotein (preferably 
MltA, Mltfi, Vac J, QmlA or Pep) and BhuR. 

A preferred immunogenic composition of the invention comprises FHA, pertussis toxin 
and BrkA or a protein sharing at least 70, 80, 90, 95, 97, 98, 99 or 100% identity with 
SEQ ID 34, preferably further comprising pertactin. 

A preferred immunogenic composition of the invention comprises FHA, pertussis toxin 
and BhuR or a protein sharing at least 70, 80, 90, 95, 97, 98, 99 or 100% identity with 
SEQ ID 20, preferably further comprising pertactin. 

A preferred immunogenic composition of the invention conqprises FHA, pertussis toxin 
and BapB or a protein sharing at least 70, 80, 90, 95, 97, 98, 99 or 100% identity with 
SEQ ID 42, preferably further conqirising pertactin. 

A preferred immunogenic composition of the invention comprises FHA, pertussis toxin 
and YapE, preferably further comprising pertactin. 

A preferred immunogenic composition of the invention comprises FHA, pertussis toxin 
and VacJ or a protein sharing at least 70, 80, 90, 95, 97, 98, 99 or 100% identity with 
SEQ ID 82, preferably further comprising pertactin. 

A preferred iromunogenic composition of the invention conq>rises FHA, pertussis toxin 
and Pep or a protein sharing at least 70, 80, 90, 95, 97, 98, 99 or 100% identity with 
SEQ ID 96, preferably further comprising pertactin. 
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A preferred immunogenic conoposilion of the invention comprises FHA, pertussis toxin 
and MltB or a protdn sharing at least 70, 80, 90, 95, 97, 98, 99 or 100% identity with 
SEQ ID 72, preferably further comprising pertactin. 

A preferred immimogenic composition of the invention comprises FHA, pertussis toxin 
and TcfA or a protein sharing at least 70, 80, 90, 95, 97, 98, 99 or 100% identity with 
SEQ ID 36, preferably further comprising pertactin. 

A preferred immimogenic composition of the invention comprises FHA, pertussis toxin 
and adenylate cyclase, preferably further comprising pertactin. 

A preferred immimogenic composition of the invention comprises FHA, pertussis toxin 
and Type HI ss, preferably further comprising pertactin. 

It is further advantageous to combine antigens that are present at di£fer^t stages of 
Bordetella life cycle in the immunogenic compositions of the invention. Bordetella has 
three identifyable stages, during which protein expression is controlled by the bvgAS 
locus. The Bvg^^ virulent phase is characterised by the expression of a number of 
virulence factors including FHA, fimbrae and pertactin, a variety of toxins including 
adenylate cyclase, dermonecrotic toxin and pertussis toxin. During the Bvgi phase, some 
of the virulence factors are expressed and a new set of proteins including BipA are 
expressed (Deora et al Moledular Microbiology (2001) 40; 669-683). 

Immunogenic compositions comprising antigens expressed in diflferent stages of the 
Bordetella life cycle fiirther defines previous embodiments of the invention and is also 
an independent embodiment of the invention. 

Antig^is expressed in the Bvg phases can be determined as set out in Deora et al 
Molecular Microbiology (2001) 40; 669-683; Stockbauer et al Molecular Microbiology 



51 



wo 2005/032584 



PCT/EP2004/011082 



(2001) 39; 65-78; Cotter and MiUer (1994) fijfect. Lnmun. 62; 3381-3390 and Scarlato 
andRappuoU (1991) J. Bacteriol. 173; 7401-7404 and US6387377. 

Currently available acellular pertussis vaccines include FHA, adenylate cyclase, 
pertactin and fbcnbrae proteins which are all BvgH- early genes. Accordingly, a fitrther 
aspect of the invention is an immunogenic composition containing 2, 3, 4, 5, 6, 7, 8, 9, 
or 10 or more antigens which are expressed in two, three or four phases selected from 
Bvg+ early, Bvg+ late, Bvg- and Bvgi, for example, Bvg+ early and Bvg+late; Bvg+ 
early and Bvg-; Bvg+ early and Bvgi: Bvgflate and Bvg-; Bvgf late and Bvgi; Bvg- and 
Bvgi; Bvgf early and Bvg+late and Bvg-; Bvg+ early and Bvg+late and Bvgi; Bvgf 
early and Bvg- and Bvgi; Bvgf late and Bvg- and Bvgi; Bvg+ early and Bvg+late and 
Bvg- and Bvgi. 

FHA, pertussis toxin, adenylate cyclase, Fim and pertacthi are e:q)ressed in Bvg+ early 
phase. 

Vag8, SphBl, Tcf and Type in SS are expressed in Bvgf late phase. 

BipA is e?q>ressed during Bvgi phase. 

LPS are present in all phases including Bvg- phase. 

Accordingly, preferred immunogenic compositions of the invention comprise 1, 2 or 3 
antigens expressed in Bvg+ early phase (preferably selected from FHA, pertussis toxin, 
Fim and pertactin and fiirther comprise 1 , 2 or 3 antigens that are expressed during 
Bvg+ late phase and/or Bvgi phase and/or Bvg- phase (preferably Bvgi). 

Preferred immunogenic compositions conq)rise BipA and an antigen e^q^ressed during 
Bvg+ early phase and/or Bvgf late phase and/or Bvg- phase. 

A preferred immunogenic con:Q>ositions comprises FHA, PT and Tcf (optionally further 
conqjrising 1, 2, or 3 of Fim, pertactin, Vag8, SphBl, Type m SS, BipA and LPS). 
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A preferred inummogenic coirpositions comprises FHA, PT and VagS (optionally 
further con5)rismg 1, 2, or 3 of Fim, pertactin, Tcf, SphBl, Type III SS, BipA and LPS). 

A preferred unmunogenic compositions comprises FHA, PT and VagS (optionally 
further comprising 1, 2, or 3 of Fim, pertactin, Tct SphBl, Type m SS, BipA and 
LPS). 

A preferred immunogenic compositions comprises FHA, PT and SphBl (optionally 
further comprising 1, 2, or 3 of Fim, pertactin, Tcf, VagS, Type III SS, BipA and LPS). 

A preferred immunogenic compositions comprises FHA, PT and Type m SS (optionally 
further comprising 1, 2, or 3 of Fim, pertactin, Tcf, VagS, SphBl, BipA and LPS). 

A preferred immunogenic con:q)ositioiis comprises FHA, PT and BipA (optionally 
further con5>rising 1, 2, or 3 of Fim, pertactin, Tcf, VagS, SphBl, Type III SS, BipA and 
LPS). 

A preferred immunogenic compositions conqprises FHA, PT and LPS (optionally further 
comprising 1, 2, or 3 of Fim, pertactin, Tcf, VagS, SphBl, Type IH SS and BipA). 

The combinations Usted above may be in the form of a subunit vaccine which contains 
isolated, preferably purified antigens. Where this is the case, it is preferred that soluble 
fiagments of some of the antigens are used. For instance, the water soluble passenger 
domain of autotransporter proteins as defined above are preferred. 

A fiirther aspect of the invention is a combination of a protein involved in Bordetella 
resistance to complement (for exan^le BrkA) and an antigen involved in Bordetella 
resistance to cellular immunity (for instance pertussis toxin). Such a combination preferably 
eHcits a protective immune response against Bordetella. This aspect fiirther defines 
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previous embodiments of the invention and is also an independent embodiment of the 
invention. 

A protein involved in Bordetella resistance to complement is defined as a Bordetella 
protein that is capable of dismpting flie effective functioning of the host's complement 
system preferably by inhibiting the classical complemCTit activation pathway. The degree 
of inhibition will be at leMt 10%, preferably 20%, more preferably 30%, more preferably 
40%, 50%, 60%, 70%, 80%, most preferably 90% or 95%. This may be measured by the 
ability of the protein to inhibit a serum killing assay as described in Infect. Immun. 69; 
3067 (2001). Examples of this sort of protein include BrkA and BrkB from Bordetella and 
fragments thereof eliciting an immunogenic response against said proteins, in particular a 
passeng^ domain (approximately from amino acid 41 to amino acid 706). 

A protein involved in Bordetella resistance to cellular immunity is defined as a Bordetella 
protein which is able to inhibit (by at least 30%, 40%, 50%, 60%, 70%, 80%, preferably 
90% or 95%) the effective fimctiomng of at least one type of cell making up the host's 
cellular immunity system. It may act by having a toxic effect on one or more of the host's 
cell populations involved in cellular immunity, for instance T lymphocytes, B 
lymphocytes, neutrophils, eosinophils, macrophages, dendritic cells or monocytes. 
Examples of such antigens include pertussis toxin, adenylate cyclase and LPS. It may 
alternatively inhibit cellular immunity by disrupting the function of cell involved in 
immunity. 

Bordetella pertussis is an obligate human pathogen and has developed mechanisms to 
survive within the hostile environment of the human host. One mechanism of doing this 
is through the action of pertussis toxin, which catalyses the ADP-ribosylation of GTP- 
binding proteins of mammalian cells. Since GTP-binding proteins are signalling 
molecules involved in regulating cellular processes, such ADP-ribosylation can lead to 
dismption of cellular function. Several inq)ortant cells of the immune system including 
neutrophils, macrophages, monocytes and lyn:q)hocytes are inhibited by pertussis toxin 
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(Weiss (1997) ASM News 63; 22). The action of pertussis toxin therefore disables the 
cellular immune response to B. pertussis. 

The complement system is another important defence mechanism in the human body. 
The level of complement in the lung is ordinarily 10-20% of that in serum, however this 
increases during inflammation (Persson (1991) Eur, Respir. 4; 1268). B. pertussis has 
developed mechanisms of evading die complement system. Firstly, the 
lipopoly^accharides ofB, pertussis do not activate the altemative pathway of 
complement (Fernandez and Weiss (1994) Infection and Immunity 62; 4727). The 
binding of antibodies to B. pertussis coiild however, lead to activation of the classical 
complement pathway. B. pertussis has developed a mechanism of iiihibiting the 
classical complement pathway, using the proteiu BrkA. 

An aspect of the invention relates to a pharmaceutical composition, preferably an 
inomunogenic composition, more preferably a vaccine and more preferably an acellular 
vaccine against Bordetella infection, con^rising an antigen involved in Bordetella 
resistance to complement and an antigen involved in Bordetella resistance to cellular 
immimity. Such antigens may be proteins, lipoproteins, polysaccharides, 
Upopolysaccharides or any other constituent of Bordetella. 

In a preferred embodiment of the invention, the pharmaceutical composition comprises 
BrkA and/or BrkB as a protein involved in Bordetella resistance to complement and PT 
and/or adenylate cyclase as a protein involved in Bordetella resistance to cellular 
immunity. Lipopolysaccharides (LPS) are antigens that are also toxic to cells iuvolved 
in iimnunity and in some embodiments of the invention could si9>plement or replace 
pertussis toxin or aden>4ate cyclase. In a further embodiment of the invention, the 
pharmaceutical composition comprises BrkA as a protein involved in Bordetella 
resistance to conq^lement, PT as a protein involved in Bordetella resistance to ceDular 
immunity and FHA. In a further embodiment of the invention, the pharmaceutical 
corrposition comprises BrkA as a protein involved in Bordetella resistance to 
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complemeat, PT as a protein involved in Bordetella resistance to cellular immunity, 
FHA and 69kDa pertactin. 

The pharmaceutical con^ositions of the invention preferably comprise one or more 
additional cross-protective Bordetella antigen. It is advantageous for a vaccine to 
generate protection against A parapertussis as well as A pertussis so that a single 
vaccine can protect against both forms of infection. BrkA is itself well conserved 
between B. pertussis and B, parapertussis, however, a better level of protection is 
achieved by the inclusion of one or more additional antigens which are conserved 
between the several strains of Bordetella. 

Several methods can be used to identify Bordetella cross-reactive antigens. Using 
genome mining, a comparison of the genomes of 5. pertussis and B. parapertussis 
would show which antigCTis are conserved between the two species. Alternatively, DNA 
chips could be used alongside sequence information to assess the expression of 
candidate antigens in B. pertussis and B. parapertussis. Antisera against Pw could be 
used to identify cross-reactive antigens by using gel electrophoresis and western 
blotting. Spot ndcrosequencing could precisely identify cross-reactive antigens. See 
Example 16 for one such suitable method. The invention embodies vaccines containing 
cross-reactive Bordetella antigens identified by the above methods or similar methods 
(jpreferably proteins from SEQ Group 2, most preferably proteins having an amino acid 
sequence having at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% or 100% 
identity to SEQ ID NO: 14, 50, 100. 102, 104, 106, 108 or 110), 

Preferred ratios of antigens for inclusion into a pharmaceutical composition are 1-10 PT 
to i BrkA or BrkB, more preferred ratios are 1-5 PT to 1 BrkA or BrkB, most preferred 
ratios~arc 2.5 PT to 1 BrlsA or BrkB. 

The incorporation of B. pertussis/B. parapertussis crossprotective antigens into the 
ixnmunogemc con^sitions of the invention fixrther defines pre viously described 
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embodiments of the invention and is also an independent eoibodiment of tiie invention, 
namely an inmnmogenic con^osition (or acellular vaccine) conqnising one or more 
antigens ^preferably isolated from eith^ or both of B. pertussis or B. parapertussis^ 
pardcularly those antigens described in Exanqple 16) that is capable of generating an 
inmmne response that is crossreactive against B. pertussis and B. parapertussis, preferably 
ciossprotective against BoideteUa disease, more preferably against A pertussis and B. 
parapertussis disease, with the proviso that the immunogenic composition does not 
con5)rise whole cell pertussis (Pw). 

Diagnostic^ Prognostic, Serotvping and Mntation Assays 

This invention is also related to the use of BASB232 polynucleotides and polypeptides of 
the invention for use as diagnostic reag^ts. Detection of BASB232 polynucleotides and/or 
polypeptides in a eukaryote, particularly a manmial, and especially a human, will provide a 
diagnostic method for diagnosis of disease, staging of disease or response of an infectious 
organism to dmgs. Eukaiyotes, particularly mammals, and especially humans, particularly 
those infected or suspected to be infected with an organism coniprising a BASB232 genes or 
proteins, may be detected at the nucleic add or amino acid level by a variety of well known 
techniques as well as by methods provided herein. 

Polypeptides and polynucleotides for prognosis, diagnosis or other analysis may be obtained 
from a putatively infected and/or infected individual's bodily materials. Polynucleotides 
from any of these soxarces, particularly DNA or RNA, may be used directly for detection or 
may be amplified enzymatically by usiog PCR or any other amplification technique prior to 
analysis. RNA, particularly mRNA, cDNA and genomic DNA may also be used in the 
same ways. Using amplification, characterization of the species and strain of infectious or 
resident organism present in an individual, may be made by an analysis of the genotype of a 
selected polynucleotide of the organism. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to a genotype of a reference sequence 
selected fix)m a related organism, preferably a different species of the same genus or a 
different strain of the same specie. Point imitations can be identified by hybridizing 
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an5)lified DNA to labeled BASB232 polynucleotide sequences. Perfectiy or significantly 
matched sequences can be distinguished fiom imperfectiiy or more significantly mismatched 
diq>lexes by DNase or RNase digestion, for DNA or KNA respectively, or by detecting 
differences in melting tenq)eratures or reoaturadon kinetics. Polynucleotide sequence 
differences may also be detected by alterations in the electrophoretic mobility of 
polynucleotide fragments in gels as compared to a reference sequrace. This may be carried 
out with or without denaturing agents. Polynucleotide differences inay also be detected by 
direct DNA or RNA sequencing. See, for example, Myers et al. Science, 230: 1242 (1985). 
Sequence changes at specific locations also may be revealed by nuclease protection assays, 
such as RNase, VI and 81 protection assay or a chemical cleavage method. See, for 
example. Cotton et a/., Proc. Natl Acad. ScL, USA, 85: 4397-4401 (1985). 

In another embodiment, an array of ohgoniicleotides probes conoprising BASB232 
nucleotide sequences or fragments thereof can be constructed to conduct efficient screening 
of, for example, genetic mutations, serotype, taxonontiic classification or identification. 
Array technology methods are well known and have general ^plicability and can be used to 
address a variety of questions in molecular genetics including gene expression, genetic 
linkage, and genetic variability (see, for example, Chee et al. Science, 274: 610 (1996)). 

Thus in another aspect, the present invention relates to a diagnostic kit which comprises: 

(a) a polynucleotide of the present invention, preferably any of the nucleotide sequences 
of SEQ Group 1, or a fragment thereof ; 

(b) a nucleotide sequence completnentary to that of (a); 

(c) a polypeptide of the present invention, preferably any of the polypeptides of SEQ 
Group 2 or a fragment thereof; or 

(d) an antibody to a polypeptide of the present invention, preferably to the polypeptides of 
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It will be ^reciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
cotDponent. Such a kit will be of use in diagnosing a disease or susceptibility to a 
Disease, among others. 

This invention also relates to the use of polynucleotides of the present invention as 
diagnostic reagents. Detection of a mutated form of a polynucleotide of the invention, 
preferably any sequences of SEQ Group 1 , which is associated with a disease or 
pathogenicity will provide a diagnostic tool that can add to, or defibae, a diagnosis of a 
disease, a prognosis of a course of disease, a determination of a stage of disease, or a 
suscq)tibiKty to a disease, which results from under-expression^ over-expression or altered 
expression of the polynucleotide, Organisnas, particularly infectious organisms, carrying 
mutations in such polynucleotide may be detected at the polynucleotide level by a variety of 
techniques, such as those described elsewhere herein. 

Cells from an organism carrying mutations or polymorphisms (allelic variations) in a 
polynucleotide and/or polypeptide of the invention may also be detected at the 
polynucleotide or polypeptide level by a variety of techniques, to allow for serotyping, for 
example. For example, RT-PCR can be used to detect mutations in the RNA. It is 
particularly preferred to use RT-PCR in conjunction with automated detection S3^ems, such 
as, for example, GeneScan. RNA, cDNA or genomic DNA may also be used for the same 
purpose, PGR. As an example, PGR primers complementary to a polynucleotide encoding 
BASB232 polypeptide can be used to identify and analyze mutations. 

The invention firrflier provides primers with 1, 2, 3 or 4 nucleotides removed from the 5' 
and/or the 3* end. These primers may be used for, among other things, amplifying 
BASB232 DNA and/or RNA isolated from a sa]iq>le d^ved from an individual, such as a 
bodily material. Hie jsrimers may be used to anqdify a polynucleotide isolated from an 
infected individual, such that flie polynucleotide noay then be subject to various techniques 
for elucidation of the polynucleotide sequmce. In this way, mutations in the polynucleotide 
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sequence may be detected and used to diag^ose and/or prognose the infection or its stage or 
course, or to serotype and/or classify ttie infectious agent 

The invention further provides a process for diagnosing, disease, preferably bacterial 
infections, more preferably iofections caused by Bordetalla, particularly B, pertussis, 
comprising determining firom a sample derived from an individual, such as a bodily 
material, an increased level of expression of polynucleotide having a sequence of any of 
the sequences of SEQ Groiq) 1 . Increased or decreased expression of a BASB232 
polynucleotides can be measured usiag any on of the methods well known in the art for 
the quantitation of polynucleotides, such as, for example, amplification, PGR, RT-PCR, 
RNase protection, Norfhem blotting, spectrometry and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
e^qnression of BASB232 polypeptides cornpared to normal control tissue sanq)les may be 
used to detect the presence of an infection, for exaii^le. Assay techniques timt can be used 
to determine levels of BASB232 polypeptides, in a san:q>le derived from a host, such as a 
bodily material, are well-known to those of skill ia the art Such assay methods include 
radioimmunoassays, cor]q>etitive-binding assays, Western Blot analysis, antibody sandwich 
assays, antibody detection and ELDSA assays. 

The polynucleotides of the invention may be used as components of polynucleotide 
arrays, preferably high density arrays or grids. These high density arrays are particularly 
usefril for diagnostic and prognostic pinposes. For example, a set of spots each 
comprisuig a different gene, and further comprising a polynucleotide or polynucleotides 
of the invention, may be used for probing, such as using hybridization or nucleic acid 
aziq>lification, using a probes obtained or derived fix>m a bodily sainple, to determine die 
presence of a particular polynucleotide sequence or related sequence in an individual. 
Such a presence may indicate the presence of a pathogen, particiilarly B, pertussis, and 
may be useful in diagnosing and/or prognosing disease or a course of disease. A grid 
conqirising a number of variants of any polynucleotide sequences of SEQ Qra\xp 1 are 
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prefeirecL Also preferred is a coDoprising a number of variants of a polynucleotide 
sequence encoding any polypeptide sequences of SEQ Group 2 . 

Antibodies 

The polypeptides and polynucleotides of the invention or variants thereof, or cells 
exprrasing Ihe same can be used as immunogens to produce antibodi^ immunospecific for 
such polypeptides or polynucleotides respectively. Alternatively, mimotopes, particularly 
pq)tide mimotopes, of epitopes within the polypeptide sequence may also be used as 
immxmogens to produce antibodies immunospecific for the polypeptide of the invention. 
The term "immunospecific" means that the antibodies have substantially greater afBnity for 
the polypeptides of the mvention than their ajBBnity for other related polypeptides m the prior 
art. 

In certain preferred embodiments of the invention there are provided antibodies against 
BASB232 polypeptides or polynucleotides. 

Antibodies generated against the polypeptides or polynucleotides of the invention can be 
obtained by administOTng the polypeptides and/or polynucleotides of the invention, or 
epitope-bearing firagments of either or both, analogues of either or both, or cells expressing 
either or both, to an animal, preferably a nonhuman, using routine protocols. For 
preparation of monoclonal antibodies, any technique known in the art that provides 
antibodies produced by continuous cell Ime cultures can be used. Examples include various 
techniques, such as those in Kohler, G. and Milstein, C, Nature 256: A95'A91 {X915)\ 
Kozbor et al. Immunology Today 4: 72 (1983); Cole et al, pg. 77-96 in MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, Alan R. liss. Inc. (1985). 

Techniques for tiie production of single chain antibodies (U.S. Patent No. 4,946,778) can be 
adapted to produce single chain antibodies to polypeptides or polynucleotides of this 
invention. Also, transgenic mice, or other organisms or animals, such as other mammals. 
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may be used to express humanized antibodies immunospecific to the polypeptides or 
polynucleotides of the invention. 

Alternatively, phage display technology may be utilized to select antibody genes with 
binding activities towards a polypeptide of tiie invention either from repertoires of PGR 
arc5)lified v-genes of lymphocytes from hmnans screened for possessing anti-BASB232 or 
from naive libraries (McCafiferty, et al, (1990), Nature 348, 552-554; Marks, et al, 
(1992) Biotechnology 10, 779-783). The aflSnity of these antibodies can also be improved 
by, for example, chain shuffling (Clackson et al, (1991) Nature 352: 628). 

The above-described antibodies may be enqjloyed to isolate or to identify clones e^spressing 
the polypeptides or polynucleotides of the invention to purify the polypeptides or 
poljoiucleotides by, for exan^le, afBnity chromatography. 

Thus, among otiiers, antibodies against BASB232 polypeptides or BASB232 
polynucleotides maybe employed to treat infections, particularly bacterial infections. 

Polyp^tide variants include antigenically, epitopically or immunologically equivalent 
variants form a particular aspect of this invention. 

Preferably, the antibody or variant thereof is modified to make it less immunogenic in the 
individual. For example, if the individual is human the antibody may most preferably be 
"humanized," where the complimentarity determining region or regions of the hybndomBr 
derived antibody has been transplanted into a human monoclonal antibody, for example as 
described in Jones et al (1986), Nature 321, 522-525 or Tempest et at, (1991) 
Biotechnology 9, 266-273. 

Antagonists and Agonists - Assays and Molecules 

Polypeptides and polynucleotides of the invention may also be used to assess the binding of 
small uutolbCthle substiates aud ligands in, for exairq)le, cells, cell-free preparations, chemical 
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libraries, and natural product ncdxtuies. Tliese substrates and ligands may be natural 
substrates and ligands or maybe structural or functional mimetics. See, eg:, Coligan et al. 
Current Protocols in Immunology 1(2): CSiapter 5 (1991). 

The screening methods may simply measure the binding of a candidate compound to the 
polypeptide or polynucleotide, or to cells or membranes bearing the polypeptide or 
polynucleotide, or a fusion protein of the polypeptide by means of a label directly or 
indirectly associated with the candidate compound. Alternatively, the screening method 
may involve competition with a labeled competitor. Further, these screening methods 
may test whether the candidate compoxmd results in a signal generated by activation or 
inhibition of the polypeptide or polynucleotide, using detection systems appropriate to the 
cells comprising the polypeptide or polynucleotide. Inhibitors of activation are generally 
assayed in the presence of a known agonist and the eflFect on activation by the agonist by 
the presence of the candidate compound is observed. Constitutively active polypeptide 
and/oT constitutively expressed polypeptides and polynucleotides may be ejcnployed in 
screening methods for inv^e agonists or inhibitors, in the absence of an agonist or 
inhibitor, by testing whether the candidate compound results in inhibition of activation of 
the polypeptide or polynucleotide, as the case may be. Further, the screening methods 
may simply comprise the steps of mixing a candidate compound with a solution 
containing a polypeptide or polynucleotide of the present invention, to form a mixture, 
measuring BASB232 polypeptides and/or polynucleotides activity in the mixture, and 
comparing the BASB232 polypeptides and/or polynucleotides activity of the mixture to a 
standard. Fusion proteins, such as those made from Fc portion and BASB232 
polypeptides, as hereinbefore described, can also be used for high-throughput screening 
assays to identify antagonists of the polypeptide of the present invention, as well as of 
phylogenetically and and/or functionally related polypeptides (see D. Bennett et aL, J Mol 
Recognition, 8:52-58 (1995); and K. Johanson et aL, J Biol Chem, 270(1 6):9459-9471 
(1995)). 
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The polynucleotides, polypeptides and antibodies that bind to and/or mteract with a 
polypeptide of the present invention may also be used to configure screening melliods for 
detecting tiie effect of added conq)ounds on the production of mRNA and/or polypeptide 
in cells. For example, an ELISA assay may be constructed for measuring secreted or cell 
associated levels of polypeptide using monoclonal and polyclonal antibodies by standard 
methods known in the art. This can be used to discover agents which may inhibit or 
enhance the production of polypeptide (also called antagonist or agonist, respectively) 
from suitably manipulated cells or tissues. 

The invention also provides a method of scrcCTing compounds to identify those which 
enhance (agonist) or block (antagonist) the action of BASB232 polypeptide or 
polynucleotides, particularly those con^oimds that are bacteriostatic and/or bactericidal. 
The method of screening may involve high-tbrougbput techniques. For example, to screen 
for agonists or antagonists, a synthetic reaction mix, a cellular conopartment, such as a 
membrane, cell envelope or cell wall, or a preparation of any thereof, comprising BASB232 
polypeptide and a labeled substrate or ligand of such polypeptides is incubated in the 
absence or the presence of a candidate molecule that may be a BASB232 agonist or 
antagonist. The ability of the candidate molecule to agonize or antagonize the BASB232 
polypeptide is reflected in decreased binding of the labeled ligand or decreased production 
of product from such substrate. Molecules that bind gratuitously, i,e.^ without inducing the 
effects of BASB232 polypeptide are most likely to be good antagonists. Molecules that 
bind well and, as the case msxy be, increase the rate of product production from substrate, 
increase signal transduction, or increase chemical channel activity are agonists. Detection of 
the rate or level of, as the case may be, production of product from substrate, signal 
transduction, or chemical channel activity may be enhanced by using a reporter system. 
Reporter systems that may be usefid in this regard include but are not limited to 
colorimetric, labeled substrate converted into product, a reporter gme that is responsive to 




(232 polynucleotide or polypq)tide activity, and binding assays known in 



the art 
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Anotiier eKsanple of an assay for BASB232 agonists is a coirpetitive assay that coinbines 
BASB232 and a potential agonist with BASB232 binding molecxiles, recombinant 
BASB232 binding molecules, natural substrates or li^nds, or substrate or ligand mimetics, 
under appropriate conditions for a con^jetitive inhibition assay, BASB232 can be labeled, 
such as by radioactivity or a colorimetric compound, such that the number of BASB232 
molecules boimd to a binding molecule or converted to product can be determined 
accurately to assess the effectivCTLess of the potential antagonist 

Potential antagonists include, among others, small organic molecules, peptides, polypeptides 
and antibodies that bind to a polynucleotide and/or polypeptide of the invention and thereby 
inhibit or extinguish its activity or eiq^ression. Potential antagonists also may be small 
organic molecules, a peptide^ a polypeptide such as a closely related protein or antibody that 
binds the same sites on a binding molecule, such as a binding molecule, without inducing 
BASB232 induced activities, thereby preventing the action or expression of BASB232 
polypeptides and/or polynucleotides by excluding BASB232 polypeptides and/or 
polynucleotides from binding. 

Potential antagonists include a snudl molecule that binds to and occupies the binding site of 
the polypeptide thereby preventing binding to ceUular binding molecules, such that normal 
biological activity is prevented Examples of small molecules include but are not limited to 
small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, J. Neurochem. 56: 560 (1991); 
OLIGODEOXYNUCIJSOTmES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, 
CRC Press, Boca Raton, FL (1988), for a description of these molecules). Preferred 
potential antagonists include con^unds related to and variants of BASB232. 

In a further aspect, the present invention relates to genetically engineered soluble fusion 
proteins comprising a polypeptide of the present invention, or a fragmrat tiiweof, and 
various portions of the constant regions of heavy or light chains of immunoglobulins of 
various subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant 
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part of the heavy chain of human IgG, particularly IgGl, where fusion takes place at the 
hinge region. In a particular embodiment, the Fc part can be removed sinq>ly by 
incorporation of a cleavage sequence which can be cleaved with blood clotting &ctor Xa. 
Furthermore, this invention relates to processes for the preparation of these fusion 
proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and 
therapy. A further aspect of fee invention also relates to polynucleotides encoding such 
fusion proteios. Examples of fusion protein technology can be found in International 
Patent Application Nos. W094/29458 and W094/22914. 

Each of the polynucleotide sequences provided herein may be used in the discovery and 
development of antibacterial confounds. The encoded protein^ upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the polynucleotide 
sequences encoding the amino temiinal regions of the encoded protein or Shine-Delgamo 
or other translation facilitating sequences of the respective mRNA can be used to 
construct antisense sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, pol^oiucleotide, agonist or 
antagonist of the invention to interfere with the initial physical interaction between a 
pathogen or pathogens and a eukaryotic, preferably mammalian, host responsible for 
sequelae of infection. In particular, the molecules of the invention may be used: in the 
prevention of adhesion of bacteria, in particular Bordetella, to etikaryotic, preferably 
mammalian, extracellular matrix proteins on in-dwelling devices or to extracellular matrix 
proteins in woimds; to block bacterial adhesion between eukaryotic, preferably 
mammalian, extracellular matrix proteins and bacterial B ASB232 proteins that mediate 
tissue damage and/or; to block the normal progression of pathog^esis in infections 
initiated oth^ than by the implantation of in-dwelling devices or by other surgical 
tedmiques. 

In accordance with yet another aspect of the invention, there are provided BASB232 
agonists and antagonists, preferably bacteristatic or bactericidal agonists and antagonists. 
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The antagonists and agonists of the invention maybe employed, for instance, to prevent, 
inhibit and/or treat disease. 

hi a further aspect, the present invention relates to mimotopes of the polypeptide of the 
invention. A mimotope is a peptide sequence, sufficiently similar to the native antigen, 
preferably a peptide or LPS (sequentially or structurally), which is capable of being 
recognised by antibodies which recognise the native peptide; or is capable of raising 
antibodies which recognise the native peptide when coupled to a suitable carrier. 

Peptide mimotopes may be designed for a particular purpose by addition, deletion or 
substitution of elected amino acids. Thus, the peptides may be modified for the purposes 
of ease of conjugation to a protein carrier. For example, it may be desirable for some 
chemical conjugation methods to include a temcdnal cysteine, hi addition it may be 
desirable for peptides conjugated to a protein carrier to include a hydrophobic terminus 
distal from the conjugated terminus of the peptide, such that the free unconjugated end 
of the peptide remains associated with the surface of the carrier protein. Thereby 
presenting the peptide in a conformation which most closely resembles that of the 
peptide as found in the context of the whole native molecule. For example, the peptides 
maybe altered to have an N-terminal cysteine and a C-tenninal hydrophobic amidated 
tail. Altematively, the addition or substitution of a D-stereoisomer form of one or more 
of the amino acids may be performed to create a beneficial derivative, for example to 
enhance stability of the peptide. 

Altematively, peptide mimotopes maybe identified using antibodies which are capable 
themselves of binding to the polypeptides of the present invention using techniques such 
as phage display technology (EP 0 552 267 Bl). This technique, generates a large number 
of peptide sequences which mimic the stmcture of the native peptides and are, therefore, 
capable of binding to anti-native peptide antibodies, but may not necessarily themselves 
share significant sequence homology to the native polypeptide. 
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Vaccines 

Anotiier aspect of fhe invmtion relates to a method for inducing an immunological 
response in an individual, particularly a mammal.^ preferably humans, which comprises 
inoculating the individual with BASB232 polynucleotide and/or polypeptide, or a 
fragment or variant thereof, or a combination thereof as described above, adequate to 
produce antibody and/ or T cell immune response to protect said individual from 
infection, particularly bacterial infection and most particularly Bordetella infection 
'mchxding B. pertzdssis and/or B. parapef^sis infection. Also provided are methods 
whereby such immunological response slows bacterial replication. Yet another aspect of 
the invention relates to a method of inducing immunological response la an individual 
which cowpiisGS delivering to such individual a nucleic acid vector, sequ^ce or ribozjone 
to direct expression of BASB232 polynucleotide and/or polypeptide, or a fragment or a 
variant thereof, for expressing BASB232 polynucleotide and/or polypeptide, or a fragment 
or a variant thereof, or a combination thereof as described above, in vivo in order to 
induce an immunological response, such as, to produce antibody and/ or T cell immune 
response, including, for example, cytokine-producing T cells or cytotoxic T cells, to 
protect said individual, preferably a human, from disease, whether that disease is ahready 
established within the individual or not. One example of administering the gene is by 
accelerating it into the desired cells as a coating on particles or otherwise. Such nucleic 
acid vector may comprise DNA, RNA, a ribozyme, a modified nucleic acid, a DNA/RNA 
hybrid, a DNA-protein complex or an RNA-protein complex. 

A frirther aspect of the invention relates to an immunological conapositlon that when 
intmduced into an individual, prefi^bly a human, capable of having induced within it an 
immunological response, induces an immunological response in such individual to a 
EASB232 polynucleotide and/or polypeptide encoded th erefrom , or a combination thereof 
as described above, wherein the composition comprises a recombinant BASB232 
polynucleotide and/or polypeptide encoded tiierefrom and/or conq)rises DNA and/or RNA 
whiich encodes and expresses an antigen of said BASB232 polynucleotide, polypeptide 
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encoded tfaerefirom, or other polypq>1ide of the invention. The immunological response 
may be used therapeutically or prophyiactically and may take the form of antibocfy 
immunity and/or cellular immunity, such as cellular immunity arising &om CTL or CD4+ 
T cells. 

A BASB232 polypeptide or a fragment thereof may be fused with co-protein or chemical 
moiety which may or may not by itself produce antibodies, but which is capable of 
stabiUzing the jSrst protein and producing a fused or modified protein which will have 
antigenic and/or immunogenic properties, and preferably protective properties. Thus 
fused recombinant protein, preferably further comprises an antigenic co-protein, such as 
lipoprotein D ftom Haemophilus influenzae, Glutathione-S-transferase (GST) or beta- 
galactosidase, or any other relatively large co-protein which solxibilizes the protein and 
facilitates production and purification thereof. MoreovCT, the co-protein may act as an 
adjuvant in the sense of providing a generalized stimulation of the immune system of the 
organism receiving the protein. The co-protein may be attached to either the amino- or 
carboxy-t^ininiis of the first protein. 

In a vaccine composition according to the invention, a BASB232 polypeptide and/or 
polynucleotide, or a fragment, or a mimotope, or a variant thereof , or a combination thereof 
as described above, may be present in a vector, such as the live recombinant vectors 
described above for example Uve bacterial vectors. 

Also suitable are non-live vectors for the BASB232 polypqjtide, or a combination thereof 
as described above, for exanq}le bacterial outer-membrane vesicles or "blebs", OM blebs 
are derived from the outer membrane of the two-layer membrane of Gram-negative 
bacteria and have been documented in many Gram-negative bacteria (Zhou, L et ah 1998. 
FEMS Microbiol Lett. 163:223-228) including C trachomatis and C psittaci. A non- 
exhaustive list of bacterial pathog^is reported to produce blebs also includes: Bordetella 
pertussis, Borrelia burgdorferi. Brucella melitensis. Brucella ovis, Esherichia coli, 
Haemophilus influenzae, Legionella pneumophila, Moraxella catarrhalis. Neisseria 
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gonorrhoeae. Neisseria meningitidis, Pseudomonas aeruginosa and Yersinia 
enterocolitica. 

Blebs have the advantage of providing outer-membrane proteins in their native 
conformation and are thus particularly useful for vacciaes. Blebs can also be improved 
for vaccine use by engineering the bacterium so as to modify the expression of one or 
more molecules at the outer membrane. Thus for example the expression of a desired 
immunogenic protein at the outer membrane, such as the BASB232 polypeptide, can be 
introduced or upregulated (e.g. by altering the promoter). Instead or in addition, the 
expression of outer-membrane molecules which are either not relevant (e.g. unprotective 
antigens or immunodominant but variable proteins) or detrimental (e.g. toxic molecules 
such as LPS, or potential inducers of an autoimmune response) can be doworegulated. 
These s^proaches are discussed in more detail below. 

The non-coding flanking regions of the BASB232 genes contain regulatory elements 
important m the expression of the gene. This regulation takes place both at the 
transcriptional and translational level. The sequence of these regions, either upstream or 
downstream of the open reading frame of the gene, can be obtained by DNA sequencing. 
This sequence information allows the deteraiination of potential regulatory motifs such as 
the different promoter elements, terminator sequences, inducible sequence elements, 
repressors, elements responsible for phase variation, the shine-dalgamo sequence, regions 
with potential secondary structure involved in regulation, as well as other types of 
regulatory motifs or sequences. This sequence is a further aspect of the invention. 

This sequmce information allows the modulation of the natural egression of the 
BASB232 genes. The upregulation of the gene expression may be accomplished by 
altmnglhe promoter, the shine-dalgamo sequence, potential repressor or operator 
elements, or any other elements involved. Likewise, downregulation of expression can be 
achieved by similar types of modification. Alternatively, by changing phase variation 
sequences, Ihe expression of the gene can be put under phase variation control, or it may 
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be i]ncoiq>led from this regulation. In another approach, the expression of the gene can be 
put under the control of one or more inducible elements allowing regulated expression. 
Examples of such regulation include^ but are not limited to, induction by ten^erature 
shift, addition of inductor substrates like selected carbohydrates or their derivatives, trace 
elements, vitamins, co-factors, metal ions, etc. 

Such modifications as described above can be introduced by several different means. The 
modification of sequences involved in gene expression can be carried out in vivo by 
random mutagenesis followed by selection for the desired phenotype. Another approach 
consists in isolating the region of interest and modifying it by random mutagenesis, or 
site-directed replacement, insertion or deletion mutagenesis. The modified region can then 
be reintroduced into the bacterial genome by homologous recombination, and the effect 
on gene expression can be assessed. In another approach, the sequence knowledge of the 
region of interest can be used to replace or delete all or part of the natural regulatory 
sequences. In this case, the regulatory region targeted is isolated and modified so as to 
contain the regulatory elements from another gene, a combination of regulatory elements 
firom different genes, a synthetic regulatory region, or any other regulatory region, or to 
delete selected parts of the wild-type regulatory sequences. These modified sequences can 
then be reintroduced into the bacterium via homologous recombination into the genome, 
A non-exhaustive list of preferred promoters that could be used for up-regulation of gene 
expression includes the promoters porA, porB, IbpB, tbpB, pi 10, 1st, hpxiAB fcomK 
meningitidis oxN. gonorroheae; ompCD, copB, IbpB, ompE, UspAl; UspA2; TbpB from 
M Caiarrhalis; pi, p2, p4, p5, p6, IpD, tbpB, D15, Hia, Hmwl, Hmw2 firom H. 
influenzae and any known strong promoter firom B. pertussis or B. parapertussis. 
In one example, tibte expression of the gene can be modulated by exchanging its promoter 
with a stronger promoter (through isolating the iq7stream sequence of the gene, in vitro 
modification of this sequence, and reintroduction into the graome by homologous 
recombination). Upregulated expression can be obtained in both the bacterium as well as 
in the outer membrane vesicles shed (or made) fi-om the bact^um. 
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la other exan^les, the described approaches can be used to generate recombinant bacterial 
strains with improved characteristics for vaccine applications. These can be, but are not 
limited to, attenuated strains, strains with increased expression of selected antigens, 
strains with knock-outs (or decreased e?qpression) of genes interfering with the immune 
response, strains with modulated expression of immunodominant proteins, strains with 
modulated shedding of outer-membrane vesicles. 

Thus, also provided by the invention is a modified upstream region of the BASB232 
genes, which modified upstream region contains a heterologous regulatory element which 
alters the expression level of the BASB232 proteins located at the outer membrane. The 
upstream region according to this aspect of the invention includes the sequence upstream 
of the BASB232 genes. The upstream region starts immediately upstream of the BASB232 
genes and continues usually to a position no more than about 1000 bp upstream of the gene 
fix)m the ATG start codon. Li the case of a gene located in a polycistronic sequence 
(operon) the iq)stream region can start immediately preceding the gene of interest, or 
preceding the first gene in the operon. Preferably, a modified upstream region according to 
this aspect of the invention contains a heterologous promotor at a position between 500 and 
700 bp upstream of the ATG. 

Thus, the invention provides BASB232 polypeptides, or a combination thereof as 
described above, in a modified bacterial bleb. The invention further provides modified host 
cells capable of producing the non-live membrane-based bleb vectors. The invention fiirfher 
provides nucleic acid vectors comprising the BASB232 genes having a modified iipstream 
region containing a heterologous regulatory elemmt 

Further provided by the invention are processes to prepare the host cells and bacterial blebs 
according to the invention. 

Also provided by this invention are compositions, particularly vaccine compositions, and 
methods conq)rising the polypeptides and/or polynucleotides of the invention and 
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iimnimostimulatoiy DNA sequences, such as those described in Sato, Y. et aL Science 
273:352 (1996). 

Also, provided by this invention are methods using the described polynucleotide or 
particular fragments thereof, which have been shown to encode non-variable regions of 
bacterial cell surface proteins, in polynucleotide constructs used in such genetic 
immunization experiments in animal models of infection with B. pertussis. Such 
experiments will be particularly useful for identifying protein epitopes able to provoke a 
prophylactic or therapeutic immune response. It is believed that this approach will allow 
for the subsequent preparation of monoclonal antibodies of particular value, derived from 
the requisite organ of the animal successfriUy resisting or clearing infection, for the 
development of prophylactic agents or therapeutic treatments of bacterial infection, 
particularly B. pertussis infection, in manomals, particularly humans. 

Immunogenic compositions 

It is advantageous for the antigens and combinations of antigens of the invention to be 
formulated into immunogenic compositions that comprise immunogenic, preferably 
immunologically effective, amounts of additional antigens to elicit immunity to other 
pathogens, preferably viruses and^or bacteria. Such additional antigens include 
diphtheria toxoid, tetanus toxoid, hepatitis B surface antigen, injectable polio vaccine, 
Haemophilus influenzae type b PRP, capsular polysaccharides or outer membranr 
vesicle preparations from N. meningitidis and capsular polysaccharides from S. 
pneumoniae. 

Preferred immunogenic compositions of the invention are fonmilated with 1, 2, 3 or 
preferably all 4 of the following meningococcal capsular polysaccharides or 
oligosaccharides : A, C, Y or W, which may be plain or conjugated to a protein carrier. 
Combinations of meningococcal polysaccharides or oligosaccharides include A and C; 
A and Y; A and W; C and Y; C and W; Y and W; A, C and Y; A, C and W; A, Y and 
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W; C, Y and W and A, C, Y and W. Such a vaccine containing proteins from K 
meningitidis serogroup B may be advantageously combine a global meningococcus 
vaccine with a Bordetella vaccine. 

Li a further preferred embodiment, the immimogenic compositions of the invention are 
formulated with a conjugated or unconjugated H, influefizae b capsular polysaccharide 
or oligosaccharide, eind one or more plain or conjugated pneumococcal capsular 
polysaccharides or oligosaccarides. Optionally, the vaccine may also comprise one or 
more protein antigens that can protect a host against Streptococcus pneumoniae 
infection. Such a vaccine may be advantageously used as a Bordetella/H. 
influex^e/steeptococcus pneumonia vaccine. 

In a fturther preferred embodiment, the immunogenic composition of the invention is 
formulated with capsular polysaccharides or oligosaccharides derived from one or more 
of Neisseria meningitidis^ Haemophilus influenzae. Streptococcus pneumoniae^ Group 
A Streptococci, Groitp B Streptococci, Staphylococcus aureus or Staphylococcus 
epidermidis. Jn a preferred embodimm^ the immunogenic composition comprises 
capsular polysaccharides or oligosaccharides derived from one or more of serogroups 
A, C, W and Y of Neisseria meningitidis, A further preferred embodiment comprises 
capsular polysaccharides or oligosaccharides derived from Streptococcus pneumoniae. 
The pneumococcal capsxilar polysaccharide or oUgosaccharide antigens are preferably 
selected from serotypes 1, 2, 3, 4, 5, 6B, 7F, 8, 9N, 9V, lOA, llA, 12F, 14, 15B, 17F, 
18C, 19A, 19F, 20, 22F, 23F and 33F (most preferably from serotypes 1, 3, 4, 5, 6B, 7F, 
9V, 14, 18C, 19F and 23F). A further prefeixed embodiment comprises the PRP 
capsular polysaccharides or oligosaccharides of Haemophilus influenzae. A further 
preferred embodiment comprises the Tjpe 5, Type 8 or 336 capsular polysaccharides or 
olrgosaccfaartdes of Stcphylococcus aureus. A further preferred embodiment comprises 
the Type I, Type n or Type HI capsular polysaccharides of Staphylococcus epidennidis. 
A fiotiier preferred enibodiment comprises the Type la. Type Ic, Type H or Type IH 
capsular pulysacuhaiides or ongosaccharides of Ghroup B streptocoocus. A further 
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preferred embodiment comprises the capsular polysaccharides or oligosaccharides of 
Group A streptococcus, preferably further comprising at least one M protein and more 
preferably multiple types of M protein. 

Capsular polysaccharides or oligosaccharides included in pharmaceutical compositions 
of the invention may be unconjugated or conjugated to a carrier protein such as tetanus 
toxoid, tetanus toxoid fragment C, non-toxic mutants of tetaus toxin, diphtheria toxoid, 
CRM197, other non-texic mutant of diphtheria toxin (such as CRM176, CRM 197, 
CRM228, CRM 45 (Uchida et al J. Biol. Chem. 218; 3838-3844, 1973); CRM 9, CRM 
45, CRM102, CRM 103 and CRM107 and other mutations described by Nicholls and 
Youle in Geneticaly Engineered Toxins, Ed: Frankel, Maecel Dekker Inc, 1992; 
deletion or mutation of Glu-148 to Asp, Ghi or Ser and/or Ala 158 to Gly and other 
mutations disclosed in US 4709017 or US 4950740; mutation of at least one or more 
residues Lys 516, Lys 526, Phe 530 and/or Lys 534 and other mutations disclosed in US 
5917017 or US 6455673; or fragment disclosed in US 5843711), pneumolysin or 
Protem D (US6342224), 

The polysaccharide conjugate may be prepared by any known coupling 
technique. For example the polysaccharide can be coiQ)led via a thioether linkage. This 
conjugation method relies on activation of the polysaccharide with l-cyano-4- 
dimefhylamino pyridinium tetrafluoroborate (CDAP) to form a cyanate ester. The 
activated polysaccharide may thus be coupled directly or via a spacer group to an amino 
group on the carrier protein. Preferably, the cyanate ester is coupled with hexane 
diamine and the amino-derivatised polysaccharide is conjugated to the carrier protein 
using heteroligation chemistry involving the formation of the tiiioether linkage. Such 
conjugates are described in PCT published application WO93/15760 Unifomied 
Services University. 

Ihe conjugates can also be prepared by direct reductive amination methods as desCTibed 
in US 4365170 (Jennings) and US 4673574 (Anderson). Other methods are described 
mEP-0-.161-188, EP-208375 andEP-0-477508. 
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A further method involves the coiq)ling of a cyanogen bromide activated 
polysaccharide derivatised with adipic acid hydrazide (ADH) to the protein carrier by 
Carbodiimide condensation (Chu C. et al Infect. Immunity, 1983 245 256). 

The immunogenic compositions of the invention may also comprise proteins j&om other 
pathogens. Preferred pneumococcal proteins antigens are those pneumococcal proteins 
which are exposed on the outer surface of the pneumococcus (capable of being 
recognised by a host's immune system during at least part of the life cycle of the 
pneimiococcus), or are proteins which are secreted or released by the pneumococcus. 
Most preferably, the protein is a toxin, adhesin, 2-component signal tranducer, or 
lipoprotein of Streptococcus pneumoniae^ or fragments thereof. Particularly preferred 
proteins include, but are not limited to: pneumolysin (pref^ably detoxified by chenncal 
treatment or mutation) [Mitchell et al. Nucleic Acids Res. 1990 Julll; 18(13): 4010 
"Conq^arison of pneumolysin genes and proteins from Streptococcus pneumoniae types 
1 and 2.", MitcheU et al Biochim Biophys Acta 1989 Jan 23; 1007(1): 67-72 
"Bxpression of the pneumolysin gene m Escherichia coli: rapid purification and 
biological properties.", WO 96/05859 (A. Cyanamid), WO 90/06951 (Paton et al), WO 
99/03884 (NAVA)]; PspA and transmembrane deletion variants thereof (US 5804193 - 
Briles et aL); PspC and transmembrane deletion variants thereof (WO 97/09994 - Briles 
et al); PsaA and transmembrane deletion variants thereof (Berry & Paton, Infect Immim 
1996 Dec;64(12):5255-62 "Sequmce heterogeneity of PsaA, a 3 7-kiiodalton putative 
adhesin essential for virulence of Streptococcus pneumoniae^y, pneumococcal choline 
binding proteins and transmembrane deletion variants thereof; CbpA and 
transmembrane deletion variants thereof (WO 97/41151; WO 99/51266); 
Glyceraldehyde-3-phosphate - dehydrogenase (Infect Immun. 1996 64:3544); HSP70 
(WO 96/40928); PcpA (Sanchez-Beato et aL FEMS Microbiol Lett 1998, 164:207-14); 
M like protein, (EP 0837130) and adhesin 18627, (EP 0834568). Further preferred 
pneumococcal protein antigens are those disclosed in WO 98/1 8931, particularly those 
selected in WO 98/1 8930 and PCT/US99/30390. 
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Prefiared proteins for inclusioa in the immunogenic composition of the invention 
include adhesins, autotransporter proteins, iron acquisition proteins and toxins &omN. 
meningitidis serotype B, optionally as part of an outear membrane vesicle preparation. 

Adhesins include FhaB (WC)98/02547), NadA (J. ExpMed (2002) 195:1445; NMB 
1994), Hsf also known as NhhA (NMB 0992) (W099/3 1 132), Hap (NMB 
1985)(W099/55873), NspA (W096/29412), MafA (NMB 0652) and MafB (NMB 
0643) (Annu Rev Cell Dev Biol. 16; 423-457 (2000); Nature Biotech 20; 914-921 
(2002)) , Omp26 (NMB 0181), NMB 0315, NMB 0995, NMB 1 1 19 and PUC (Mol. 
Microbiol. 1997, 23; 879-892). These are proteins that are involved in the binding of 
Neisseria to the surface of host cells. 

Autotransporter proteins typically are made up of a signal sequence, a passenger domain 
and an anchoring domain for attachment to the outer membrane. Examples of 
autotransporter proteins include Hsf (W099/3 1132) (NMB 0992), HMW, Hia (van 
XJlsen et al Immunol. Med. Microbiol. 2001 32; 53-64), Hap (NMB 1985) 
(W099/55873; van Ulsen et al Immunol. Med. Microbiol. 2001 32; 53-64), UspA, 
UspA2, NadA (NMB 1994) (Comanducci et al J. Exp. Med. 2002 125; 1445-1454), 
AspA (Infection and Immunity 2002, 70(8); 4447-4461; NMB 1029), Aida-1 like 
p)x>tein, SSh-2 and Tsh. The passenger domain of an autotransporter protein is a • 
preferred fragment for incorporation into the immunogenic composition of the 
invention. 

Iron aquisition proteins include TbpA (NMB 0461) (WO92/03467, US5912336, 
WO93/06861 and EP586266), TbpB (NMB 0460) (WO93/06861 and EP586266), LbpA 
(NMB 1540) (Med Microbiol (1999) 32:1117), LbpB (NMB 1541)(WO/99/09176), 
HpuA (U73112.2) (Mol MicrobioL 1997, 23; 737-749), HpuB (NC_003116.1) (Mol 
Microbiol. 1997, 23; 737-749), P2086 also known as XthA (NMB 0399) (13* 
International Pathogenic Neisseria Conference 2002), FbpA (NMB 0634), FbpB, BfrA 
(NMB 1207), BfrB (NMB 1206), Lipo28 also known as GNA2132 (NMB 2132), Sibp 
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(NMB 1882), HmbR, HemH, Bcp (NMB 0750), Iron (HI) ABC transporter-permease 
protein (Tettelin et al Science 287; 1809-1815 2000), Iron (lH) ABC transporter- 
periplasmic (Tettelin et al Science 287; 1809-1815 2000), TonB-dependent receptor 
(NMB 0964 and NMB 0293)(Tettelin et al Science 287; 1809-1815 2000) and 
transferrin binding protein related protein (Tettelin et al Science 287; 1809-1815 2000). 

Toxins include FrpA (NMB 0585; NMB 1405), FrpA/C (see below for definition), 
FrpC ( NMB 1415; NMB 1405) (WO92/01460), NM-ADPRT (NMB 1343) (13* 
Intemational Pathogenic Neisseria Conference 2002 Masignani et al pi 35), VapD 
(NMB 1753), lipopolysaccharide (LPS; also called lipooligosaccharide or LOS) 
immunotype L2 and LPS immunotype L3. FrpA and FipC contain a region which is 
conserved between these two proteins and a preferred firagment of the proteins would be 
a polypeptide containing this conserved firagment, preferably comprising amino acids 
227-1004 of the sequence of FrpA/C. 

The meningococcal proteins included ia the immunogenic conq)osition of the invmtion 
may be present as a subunit composition in which the purified protein or an 
immunogenic firagment of the protein is added to the immunogenic composition. 
Optionally, the protein is added as part of an outer membrane vesicle preparation. 

The immxmogenic composition optionally comprises antigens providing protection 
against Diphtheria and/or tetanus infections. Typically, the antigens providing protection 
against Diphtheria and tetanus would be Diphtheria toxoid and tetanus toxoid. The 
toxoids may be chemically inactivated toxins or toxins inactivated by the introduction of 
point mutations. 

It is advantageous to combine the immunogenic composition of the invention with 
antigens that confer immunity against one or more of Haemqphiltds influenzae b, 
hepatitis B and/or polio virus. Preferred pharmaceutical compositions of the invention 
will further comprise one or more , most preferably all three of PRP polysaccharide or 
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oligosaccharide of Haemophilus influenzae b, hepatitis B surface antigen and/or 
injectable poUo virus (IPV). 

The immunogenic consposition optionally conq)rises one or more antigens that can 
protect a host against RS V and/or one or more antigens that can protect a host against 
influenza virus. 

Preferred influenza virus antigens include whole, live or inactivated virus, split 
influenza virus, grown in eggs or MDCK cells, or Vero cells or whole flu virosomes (as 
described by R. Gluck, Vaccine, 1992, 10, 915-920) or purified or recombinant proteins 
thereof, such as HA, NP, NA, or M proteins, or combinations thereof 
Preferred RS V (Respiratory Syncytial Virus) antigens include the F glycoprotein, tiie G 
glycoprotein, the HN protein, or derivatives thereof. 

Preferred non-typeable H. influenzae protein antigens include Fimbrin protein (US 
5766608) and fiisions con^rising peptides therefrom (eg LBl Fusion) (US 5843464 - 
Ohio State Research Foundation), OMP26, P6, protein D, TbpA, TbpB, Hia, Hmwl, 
Hmw2, Hap, and D15. 

It should be appreciated that immunogenic compositions of the invention may comprise 
one or more capsular polysaccharide or oligosaccharide from a single species of bacteria. 
Immunogenic compositions may also comprise capsular polysaccharides or 
oUgosaccharide derived from one or more species of bacteria. 

Vaccines 

A ftirfher embodiment of the invention provides a vaccine formulation which conq)rises 
an immunogenic recombinant polypeptide and/or polynucleotide of the invention, or a 
combination thereof, together with a suitable carrier/excipient, such as a phannaceutically 
acceptable carrier/excipient Since the polypeptides and polynucleotides may be broken 
down in the stomach, each is preferably administered parenterally, including, for example, 
adnainistration that is subcutaneous, intramuscular, intravenous, or intradermal. 
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Fommlations suitable for parenteral administration include aqueous and non-aqueous 
sterile injection solutions which may contain anti-oxidants, buffers, bacteriostatic 
conq>ounds and solutes which render the formulation isotonic with the bodily fluid, 
preferably the blood, of the individual; and aqaeous and non-aqueous sterile suspensions 
which may include suspending agents or thickening agents. The formulations may be 
presented in unit-dose or multi-dose contains, for example, sealed ampoules and vials 
and may be stored in a Jfreeze-dried condition requiring only the addition of the sterile 
Uquid carrier immediately prior to use. 

The vaccine formulation of the invention may also include adjuvant systems for 
enhancing the immunogenicity of the formulation. The adjuvant may be aluminium 
hydroxide, almninium phosphate or a mixture of aluminium hydroxide and alimiinium 
phosphate. Where hepatitis B surface antigen is present as part of the vaccine, the 
adjuvant is preferably aluminium phosphate. Preferably the adjuvant systCTa raises 
preferentially a THl type of response. 

An inomune response may be broadly distinguished into two extreme catagories, being a 
humoral or cell mediated immune responses (traditionally characterised by antibody and 
cellular effector mechanisms of protection respectively). These categories of response 
have been termed THl -type responses (cell-mediated response), and TH2-type immune 
responses (humoral response). 

Extreme THl -type immune responses may be characterised by the generation of antigen 
specific, haplotype restricted cytotoxic T lymphocytes, and natural killer cell responses. 
In mice THl -type responses are often characterised by the generation of antibodies of 
the IgG2a subtype, whilst in the human these correspond to IgQl type antibodies. TH2- 
^e immune responses are characterised by the generation of a broad range of 
immunoglohnlin isotypes including in mice IgGl, IgA, and IgM. 
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It can be considered that the driving force behind the development of these two types of 
inmnine responses are cytokines. High levels of THl-type cytokines tend to favour the 
induction of cell mediated immune responses to the given antigen, whilst hi^ levels of 
TH2-type cytokines tend to favour the induction of humoral immune responses to the 
antigen. 

The distinction of THl and TH2-type immme responses is not absolute, hi reality an 
individxial will support an immune response which is described as being predominantly 
THl or predominantly TH2. However, it is often convenient to consider the families of 
cytokines in terms of that described in murine CD4 4-ve T cell clones by Mosmann and 
Coffinan {Mosmann, T.R and Cqffman, R.L. (J 989) THl and TH2 cells: different 
patterns oflymphokine secretion lead to different functional properties. Annual Review 
of Immunology, 7, />7-^5-7 75). Traditionally, THl-type responses are associated with 
the production of tiie INF-y and IL-2 cytokines by T-lymphocytes. Other cytokines often 
directly associated with the induction of THl-type immune responses are not produced 
by T-cells, such as lL-12. hi contrast, TH2- type responses are associated with the 
secretion of IL-4, IL-5, IL-6 and IL-13. 

It is known that certain vaccine adjuvants are particularly suited to the stimulation of 
either THl or TH2 - type cytokine responses. Traditionally the best indicators of the 
THl :TH2 balance of the immxme response after a vaccination or infection includes 
dhrect measurement of the production of THl or TH2 cytokines by T lymphocytes in 
vitro after restimulation with antigen, and/or the measurement of the IgGl:IgG2a ratio 
of antigen specific antibody responses. 

Thus, a THl-type adjuvant is one which preferentially stimulates isolated T-cell 
populations to produce high levels of THl-type cytokmes when re-stimulated with 
antigen in vitro, and promotes development of both CD8+ cytotoxic T lymphocytes and 
antigen specific immimoglobulin responses associated with THl-type isotype. 
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Adjuvants which are capable of prefereatial stimulation of the THl cell response are 
described in Ihtemational Patent Application No. WO 94/00153 and WO 95/17209, 

3 De-O-acylated monophosphoryl lipid A (3D-MPL) is one such adjuvant. This is 
known from GB 222021 1 (Ribi). Chemically it is a mixture of 3 De-O-acylated 
monophosphoryl lipid A with 4, 5 or 6 acylated chains and is manufactured by Ribi 
hnmunochem, Montana. A preferred form of 3 De-O-acylated monopho^horyl lipid A 
is disclosed in European Patent 0 689 454 Bl (SmithKline Beecham Biologicals SA). 

Preferably, the particles of 3D-MPL are small enough to be sterile filtered through a 
0.22micron membrane (Etiropean Patent number 0 689 454). 
3D-MPL will be present in the range of 10(ig - lOO^g preferably 25-50jLig per dose 
wherein the antigen will typically be present in a range 2-50|j,g per dose. 

Another preferred adjuvant comprises QS21, an Hplc purified non-toxic firaction derived 
firom the bark of Quillaja Saponaria Molina. Optionally this may be admixed with 3 
De-O-acylated monophosphoryl lipid A (3D-MPLX optionally together with an carrier. 

The method of production of QS21 is disclosed in US patent No. 5,057,540, 

Non-reactogenic adjuvant formulations containing QS21 have been described 
previously (WO 96/33739). Such formulations conqmsing QS21 and cholesterol have 
been shown to be successful THl stimulating adjuvants when formulated together with 
an antigen. 

Further adjuvants which are preferential stimulators of THl cell response include 
immunomodulatory oligonucleotides, for example unmethylated CpG sequences as 
(tisolosed in WO 96/02555. 
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Combinations of different THl stimulating adjuvants, such as those mentioned 
hereinabove, are also contenq)lated as providing an adjuvant which is a preferential 
stimulator of THl cell response. For example, QS21 can be formulated together with 
3D-MPL. The ratio of QS21 : 3D-MPL will typicaUy be in the order of 1 : 10 to 10 : 1; 
preferably 1 :5 to 5 : 1 and often substantially 1:1. The preferred range for optimal 
synergy is 2.5 : 1 to 1 : 1 3D-MPL; QS21. 

Preferably a carrier is also present in the vaccine conqjosition according to the 
mveution. The carrier may be an oil in water emulsion, or an aluminium salt, such as 
aluminium phosphate or altuninium hydroxide. 

A preferred oil-in-water emulsion conq>rises a metaboUsible oil, such as squalene, alpha 
tocopherol and Tween 80. In a particularly preferred aspect the antigens m the vaccme 
conqjosition accordmg to the mvention are combmed witii QS21 and 3D-MPL in such 
an emulsion. Additionally the oil in water emulsion may contam span 85 and/or lecithin 
and/or tricaprylin. 

Typically for human administration QS21 and 3D-MPL wiU be present in a vaccine in 
the range of l^g - 200ng, such as lO-lOO^g, preferably lOjig - SOp-g per dose. 
Typically the oil m water will comprise from 2 to 10% squalene, from 2 to 10% alpha 
tocopherol and from 0.3 to 3% tween 80. Preferably the ratio of squalene: alpha 
tocopherol is equal to or less than 1 as this provides a more stable emulsion. Span 85 
may also be present at a level of 1%. In some cases it may be advantageous that the 
vaccines of the present mvention will fiirther contain a stabiliser. 



leor 



Non-toxic oil in water emulsions preferably contain a non-toxic oil, e.g. squalane 
squalene, an emulsifier, e.g. Tween 80, in an aqueous carrier. The aqueous carrier may 
be, for exarople, phosphate buffered saline. 
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A particularly potent adjuvant formulation involving QS21, 3D-MPL and tocopherol in 
an oil in water emulsion is described in WO 95/17210. 

Wbile the iavmtion has been described with reference to certain BASB232 polypq)tides 
and polynucleotides, it is to be understood that this covers fragments of the naturally 
occurring polypeptides and polynucleotides, and similar polypeptides and polyaucleotides 
with additions, deletions or substitutions which do not substantially affect the 
immunogenic properties of the recombinant polypeptides or polynucleotides. 

Compositions^ kits and administration 

In a further aspect of the invention there are provided con^jositions comprising a BASB232 
polyuucleotide and/or a BASB232 polypeptide for administration to a cell or to a 
multicellular organism. 

The invention also relates to compositions conqmsing a polynucleotide and/or a 
polypeptides discussed herein or their agonists or antagonists. The polypeptides and 
polynucleotides of the invention maybe employed in combination with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical 
carrier suitable for administration to an individual. Such compositions conq[>rise, for 
instance, a media additive or a tiierapeutically effective amount of a polypeptide and/or 
polynucleotide of the invention and a pharmaceutically acceptable carrier or excipient. Such 
carriers may include, but are not limited to, saline, buffered saline, dextrose, water, glycerol, 
ethanol and combinations thereof. The formulation should suit the mode of administration. 
The invention fiirther relates to diagnostic and pharmaceutical packs and kits comprising 
one or more containers filled with 1, 2, 3^ 4 or 5 of the ingredients of the aforemmtioned 
conQ}ositions of the inventioiL 

Polypeptides, polynucleotides and other conqpounds of the invention may be eirployed 
alone or in conjunction with other confounds, such as therapeutic compounds. 
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Tlie phannaceutical con^ositions may be administered in any effective, convenient manner 
including, for instance, administration by topical, oral, anal, vaginal, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an individual as 
an injectable con:^osition, for example as a sterile aqueous dispersion, preferably isotonic. 

In a further aspect, the present invention provides for pharmaceutical con:^ositions 
con:5)rising a th^^utically effective amount of a polypeptide and/or polynucleotide, such 
as the soluble form of a polypeptide and/or polyaucleotide of the present iavention, agonist 
or antagonist peptide or small molecule conqjound, in combination with a pharmaceuticaUy 
acceptable canier or excipient Such carriers include, but are not limited to, saline, buffered 
saline, dextrose, water, glycerol, ethanol, and combinations thereof. The invention further 
relates to pharmaceutical packs and kits comprising one or more containers filled with one 
or more of the ingredients of the aforementioned conopositions of the invention. 
Polypeptides, polynucleotides and other compounds of the present invention may be 
ernployed alone or in conjunction with other con9X)iinds, such as ther^eutic conq>ounds. 

The conqiosition will be adapted to the route of administration, for instance by a systemic or 
an oral route. Preferred forms of systemic administration include injection, typically by 
intravenous injection. Other injection routes, such as subcutaneous, intramuscular, or 
intraperitoneal, can be used. Alternative means for systemic administration include 
transmucosal and transdamal administration using penetrants such as bile salts or fusidic 
acids or other detergents. In addition, if a polypeptide or other compoxmds of the present 
invention can be formulated in an enteric or an encapsulated formulation, oral 
administration may also be possible. Administration of these compoimds may also be 
topical and/or localized, in the form of salves, pastes, gels, solutions, powders and the like. 

For administration to mammals, and particularly humans, it is expected tiiat the daily 
dosage level of the active agent will be from 0.01 mg/kg to 10 mg/kg, typically around 1 
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mg/1cg. The physician in any event will determine the actual dosage which will be most 
suitable for an individual and will vaiy with the age, weight and response of the particular 
individual. The above dosages are exemplary of the average case. There can, of course, 
be individual instances where higher or lower dosage ranges are merited, and such are 
within the scope of this invention. 

The dosage range required depends on the choice of peptide, the route of administration, the 
nature of the formulation, the nature of the subject's condition, and the judgment of the 
attending practitioner. Suitable dosages, however, are in the range of 0. 1-100 jig/kg of 
subject. 

A vaccine composition is convenienfly in injectable form. Conventional adjuvants may be 
employed to enhance the immune response. A suitable unit dose for vaccination is 0.5-5 
microgram/kg of antigen, and such dose is preferably administered 1-3 times and with an 
interval of 1-3 weeks. With the indicated dose range, no adverse toxicological effects will 
be observed with the conopounds of the invention which would preclude their 
administration to suitable individuals. 

Wide variations in flie needed dosage, however, are to be expected in view of the variety of 
confounds available and the differing efBciencies of various routes of administration. For 
example, oral administration would be expected to require higher dosages than 
administration by intravenous injection. Variations in these dosage levels can be adjusted 
using standard empirical routines for optimization, as is well understood in the art. 

Sequence Da^ases. Sequences in a Tangible Medium, and Algorithms 
Polynucleotide and polypeptide sequ^ces form a valuable information resource with which 
to determine-thdr 2- and 3-dimensional stmctures as well as to identify fiirflier sequences of 
similar homology. These approaches are most easily facilitated by storing the sequence in a 
computer readable mediimi and then using the stored data in a known macromolecular 
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structuie program or to search a sequence database using well known searching tools, such 
as the GCG program package. 

Also provided by the invention are methods for the analysis of character sequences or 
strings, particularly genetic sequences or encoded protein sequences. Preferred methods 
of sequence analysis include, for example, methods of sequrace homology analysis, such 
as identity and similarity analysis, DNA, RNA and protein structure analysis, sequence 
assembly, cladistic analysis, sequence motif analysis, open reading frame determination, 
nucleic acid base calling, codon usage analysis, nucleic acid base trimming, and 
sequenciQg chromatogram peak analysis. 

A computer based method is provided for performing homology identification. This 
method comprises the steps of: providing a first polynucleotide sequence conqwising the 
sequence of a polynxicleotide of die invention in a conoputer readable medium; and 
conqjaring said first polynucleotide sequence to at least one second polynucleotide or 
polypeptide sequence to identify homology. 

A computer based method is also provided for performing homology identification, said 
method comprising the steps of: providing a first polypeptide sequence comprising the 
sequence of a polypeptide of the invention in a computer readable medium; and 
con5)aring said first polypeptide sequence to at least one second polynucleotide or 
polypeptide sequence to identify homology. 

All publications and references, including but not limited to patents and patent 
applications, cited in this specification are herein incorporated by reference in their 
entirety as if each individual publication or reference were specifically and individually 
indicated to be incorporated by reference herein as bemg fiiUy set forth. Any patent 
application to which this application clauns priority is also mcorporated by reference 
herein in its entirety in the manner described above for publications and references. 
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DEFENmONS 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences 
or two or more polynucleotide sequences, as the case may be, as detemiined by comparing 
the sequences, hi the art, "identity" also means the degree of sequence relatedness between 
poljfpeptide or polynucleotide sequences, as the case may be, as determined by the match 
betweai strings of such sequences. "Identity" can be readily calculated by known 
methods, including but not limited to those described in (Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data^ Part I, Griffin, A.M., and GiifBn, H,G., eds., 
Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology^ von Heine, 
G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., 
eds., M Stockton Press, New York, 1991 ; and Carillo, H., and Lipman, D., SIAM J. 
Applied MatK 48: 1073 (1988). Methods to determine identity are designed to give the 
largest match between die sequences tested. Moreover, methods to determine identity are 
codified in publicly available conq>uter programs. Computer program methods to 
determine id^tity between two sequences include, but are not limited to, the GAP 
program in the GCG program package (Devereux, J., et al., Nucleic Acids Research 1 2(1 ): 
387 (1984)), BLASTP, BLASTN (Altschul, S.R et al., J. Molec. Biol 215: 403-410 
(1990), and FASTA( Pearson and Lipman Proc. Natl. Acad. Sci. USA 85; 2444-2448 
(1988). The BLAST family of programs is publicly available from NCBI and other 
sources {BLAST Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; 
Altschul, S., et al, J. Mol Biol 215: 403-410 (1990). The well known Smith Waterman 
algorithm may also be used to detranine identity. 

Paiameleis for polypeptide sequence comparison include the following: 
Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Conqjarison matrix: BLOSSUM62 from Henikoff and Henikoflf, 
Proc: Nad. Acad. Sci USA. 89:10915-10919 (1992) 
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Gap Penalty: 8 

Gap Length Penalty: 2 

A program iiseM with these parameters is publicly available as the "g^" program fix>m 
Genetics Computer Group, Madison WL The aforementioned parameters are the default 
parameters for pq>tide con^^arisons (along with no penalty for end gaps). 

Paramet^ for polynucleotide comparison include the following: 
Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Comparison matrix: matches = +10» mismatch = 0 
Gap Penalty: 50 
Gap Length Penalty: 3 

Available as: The "gap" program from Genetics Computer Group, Madison WL These 
are the default parameters for nucleic acid comparisons. 

A preferred meaning for "identity" for polynucleotides and polypeptides, as the case may 
be, are provided in (1) and (2) below. 

(1) Polynucleotide embodiments further include an isolated polynucleotide 
comprising a polynucleotide sequence having at least a SO, 60, 70, 80, 85, 90, 95, 97 or 
100% identity to the reference sequence of SEQ ID NO: 1, wherein said polynucleotide 
sequence may be identical to the reference sequence of SEQ ID NO: 1 or may include up 
to a certain integer number of nucleotide alterations as compared to the reference 
sequence, wherein said alterations are selected from the groiq> consisting of at least one 
nucleotide deletion, substitution, including transition and transversion, or insertion, and 
wherein said alterations may occur at the 5' or 3* terminal positions of the reference 
nucleotide sequence or anywhere between those temiinal positions, interspersed either 
individually among the nucleotides in the rdS^CTice sequmce or in one or more 
contiguous groups within the reference sequence, and wherein said number of nucleotide 
alterations is determined by multiplying the total nimiber of nucleotides in SEQ ID NO: 1 
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by the integer defining the percent identity divided by 100 and then subtracting that 
product fronoi said total number of nucleotides in SEQ ID NO:l, or: 

nn^Xn-(xn*y), 

wherein n^ is the number of nucleotide alterations, % is the total number of nucleotides 
in SEQ ID NO:l, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol for 
the multiplication operator, and wherein any non-integer product of Xj^ and y is rounded 
down to the nearest integer prior to subtracting it firom x^. Alterations of a polynucleotide 
sequence encoding the polypeptide of SEQ ID NO:2 may create nonsense, missense or 
firameshift mutations in tiiis coding sequence and thereby alter the polypeptide encoded by 
the polynucleotide following such alterations. 

By way of example, a polynucleotide sequence of tiie present invention may be idratical 
to the reference sequence of SEQ ID NO: 1, that is it may be 100% identical, or it may 
include up to a certain integer number of nucleic acid alterations as compared to the 
reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one nucleic acid deletion, 
substitution, including transition and transversion, or insertion, and wherein said 
alterations may occxir at the 5' or 3' terminal positions of the reference polynucleotide 
sequence or anywhere between those terminal positions, interspersed either individually 
among the nucleic acids in the reference sequence or in one or more contiguous groiq)s 
within the reference sequence. The number of nucleic acid alterations for a given percent 
identity is detemiined by multiplying the total number of nucleic acids in SEQ ID NO: 1 
by the integer defining the percent identity divided by 100 and then subtracting tiiat 
product from said total number of nucleic acids in SEQ ID NO: 1, on 

nn^Xn-(Xn*yX 
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whereiii Un is the numb^ of nucleic acid alterations^ Xjx ^ nmnber of nucleic 

acids in SEQ ID NO:l, y is, for instance 0-70 for 70%, 0.80 for 80%, 0.85 for 85% etc., • 
is the symbol for the multiplication operator, and wherein any non-integer product of 
and y is rounded down to the nearest integer prior to subtracting it from x^. 

(2) Polypeptide embodimCTts fiirther include an isolated polypeptide comprisiiig a 
polypeptide having at least a 50,60, 70, 80, 85, 90, 95, 97 or 100% identity to a 
polypeptide reference sequence of SEQ ID NO:2, wherein said polypeptide sequence may 
be identical to the reference sequence of SEQ ID NO:2 or may include iip to a certain 
integer number of amino acid alterations as compared to the reference sequence, wherein 
said alterations are selected from the group consisting of at least one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion, and 
wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sjsquence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groups within the reference sequence, and wherein said nimiber of 
amino acid alterations is determined by multiplying the total number of amino acids in 
SEQ ID NO:2 by the integer defining the percent identity divided by 1 00 and then 
subtracting that product from said total number of amino acids in SEQ ID NO:2, or: 

na^Xa-(Xa»y), 

wherein n^ number of amino acid alterations, x^ is the total number of amino acids 
in SEQ ID NO;2, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is flie symbol for 
the multiplication operator, and wherein any non-integer product of x^ and y is roimded 
down to the nearest integer prior to subtracting it from Xa- 
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By way of example, a polypeptide sequence of the present invention may be identical to 
the reference sequence of SEQ ID NO:2, that is it may be 100% identical^ or it may 
include tip to a certain integer number of amino acid alterations as con:q)ared to the 
reference sequence such that the p^cent identity is less than 100% identity. Such 
alterations are selected from the gcoiqp consisting of at les^t one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion, and 
wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groiq)S within the reference sequence. The number of amino acid 
alterations for a given % identity is determined by multiplying the total number of amino 
acids in SEQ ID NO:2 by the integer defining the percrat identity divided by 100 and then 
subtracting that product from said total number of amino acids in SEQ ID NO:2, or: 



na^xa-(xa«y). 



wherein n^ is the number of amino acid alterations, is the total number of amino acids 
in SEQ ID NO:2, y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., and • is 
the symbol for the multiplication operator, and wherein any non-integer product of and 
y is rounded down to the nearest integer prior to subtracting it from x^. 

The terms "comprising", "comprise" and "comprises" herein is intended by the inventors to 
be optionally substitutable with the terms "consisting of", "consist of\ and "consists of, 
respectively, in every instance. 

^Immunogenic composition" in the context of a polynucleotide means that vsdien the 
polynucleotide is introduced into a host and protein is expresssed from that 
polynu^eetid&, the-expressed protdn is immimogenic. 
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"Individiial(s)," when used herein with reference to an organism, means a multicellular 
eukaryote, including, but not limited to a metazoan, a mammal, an ovid, a bovid, a simian, 
aprimate, and a himian. 

"Isolated" means altered *lyy the hand of man" from its natural state, /.e, if it occuib in 
nature, it has been changed or removed from its original environment, or both. For example, 
a polynucleotide or a polypeptide naturally present in a living organism is not "isolated," but 
the same polynucleotide or polypeptide separated from the coexisting materials of its natural 
state is "isolated", as the term is employed herein. Moreover, a polynucleotide or 
polypeptide that is introduced into an organism by transformation, genetic manipulation or 
by any other recombinant mefliod is "isolated" even if it is still present in said organism, 
which organism n^y be living or non-hving. 

"Polynucleotide(s)" generally rrfers to any polyribonucleotide or polydeoxyribonucleotide, 
which maybe unmodified RNA or DNA or modified RNA or DNA including single and 
double-stranded re^ons. 

"Toxin" preferably includes a toxoid form of the toxin. 

"Variant" refers to a polynucleotide or polj^eptide that differs from a reference 
polynucleotide or polypeptide, but retains ess^tial properties. A typical variant of a 
polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes in the nucleotide sequence of the variant may or may not alter the amino acid 
sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide changes 
may result in amino acid substitutions, additions, deletions, fusions and truncations in 
the polypq)tide encoded by the reference sequence, as discussed below. A typical 
variant of a polypeptide differs in amino acid sequence from another, reference 
polypeptide. Gen^:ally, differences are limited so that the sequences of the reference 
polypeptide and flie variant are closely similar ov^U and, in many regions, identical. 
A variant and reference polypeptide may differ in amino acid sequence by one or more 
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substitutions, additions, deletions in any combination. A substituted or inserted amino 
acid residue may or may not be one encoded by the genetic code. A variant of a 
polynucleotide or polypeptide may be a naturally occurring such as an allelic variant, or 
it may be a variant that is not known to occur naturally. Non-naturally occurring 
variants of polynucleotides and polypeptides maybe made by mutagenesis techniques or 
by direct synthesis. 

"Disease(s)" means any disease caused by or related to infection by a bacteria, including, 
for example, otitis media in infants and children, pneumonia in elderlies, sinusitis, 
nosocomial infections and invasive diseases, chronic otitis media with hearing loss, fluid 
accimiulation in the middle ear, auditive nerve damage, delayed speech learning, infection 
of flie upper respiratory tract and inflammation of the middle ear. 
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EXAMPLES: 

The examples below are carried out using standard techniques, wtdch are well 
known and routine to those of skill in the art, except where otherwise described in 
detail. The examples are illustrative, but do not limit the invention. 

Examplel: Cloning of the BASB232 genes S. pertussis strain Tnhama T. 

Genomic DNA is extracted from the B. pertussis strain Tohama I from 10'° bactaial 
cells using the QIAGEN genomic DNA extraction kit (Qiagen Gmbh). This material 
(lug) is then submitted to Polymerase Chain Reaction DNA amplification using two 
specific primers. A DNA fragment is obtained, digested by the suitable restriction 
endonucleases and inserted into frie con5>atible sites of the pET cloning/expression 
vector (Novagen) using standard molecular biology techniques (Molecular Cloning, a 
Laboratory Manual, Second Edition, Eds: Sambrook, Fritsch & Maniatis, Cold Spring 
Harbor press 1989). Recombinant pET-BASB232 is then submitted to DNA sequencing 
using the Big Dyes kit (Applied biosystems) and analyzed on a ABI 373/A DNA 
sequencer in the conditions described by the siqjpliCT. 

Example 2; Express ion and purification of recombinant BASB232 proteins in 
Escherichia colL 

The construction of the pET-BASB232 cloning/expression vector is described in Exanq)le 
1. This vector harbours the BASB232 gene isolated from Bordetella pertussis stram 
Tohama I in fiision with a sti»tch of 6 Histidine residues, placed under the control of the 
strong bacteriophage T7 gene 10 promoter. For expression study, this vector is mtroduced 
mto the Escherichia coli strain Novablue (DE3) (Novagen), in which, the gene for the T7 
polymerase is placed under the control of the isopnopyl-beta-D thiogalactoside (IPTG)- 
regulatable lac promoter. Liquid cultures (100 ml) of the Novablue (DE3) [pET- 
BASB232] E. coli recombuiant strain are grown at 37°C under agitation until the optical 
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density at 600mn (OD600) reached 0,6. At that time-point, IPTG is added at a final 
concentration of ImM and the culture is grown for 4 additional hours. The culture is then 
centrifiiged at 10,000 ipm and the pellet is frozen at -20°C for at least 10 hours. After 
thawing, the pellet is resuspended during 30 min at 25®C in buffer A (6M guanidine 
hydrochloride, O.IM NaH2P04, O.OIM Tris, pH 8.0), passed three-times through a needle 
and clarified by centrifugation (2000Qrpm, 15 min). The sample is then loaded at a flow- 
rate of Iml/min on a Ni2+ -loaded Hitrap colimm (Pharmacia Biotech). After passsage of 
the flowthrough, the column is washed succesively with 40ml of buffer B (8M Urea, 
0.1MNaH2PO4, O.OIM Tris, pH 8,0), 40ml of buffer C (8M Urea, 0.1MNaH2PO4, 
0.0 IM Tris, pH 6.3). The recombinant protein BASB232/His6 is then eluted jfrom the 
column with 30ml of buffer D (8M Urea, 0.1MNaH2PO4, O.OIM Tris, pH 6.3) containing 
500mM of imidazole and 3ml-size fractions are collected. Highly enriched 
BASB232/His6 protein can be eluted from the colimm. This polypq;>tide is detected by a 
mouse monoclonal antibody raised against the 5-histidine motif. Moreover, the denatured, 
recombinant BASB232-ifis6 protein is solubilized in a solution devoid of urea. For this 
purpose, denatured BASB232-His6 contained in 8M urea is extensively dialyzed (2 hours) 
against buffer R (NaCl 150mM, lOmM NaH2P04, Arginine 0.5M pH6.8) containing 
successively 6M, 4M, 2M and no urea. Alternatively, this polypqjtide is purified under 
non-denaturing conditions using protocoles described in the Quiexpresssionist booklet 
(Qiagen Gmbh). 

Example 3: Production of Antisera to Recombinants BASB232 

Polyvalent antisera directed against the BASB232 protein are graerated by vaccinating 
rabbits with the purified recombinant BASB232 protein. Polyvalent antisem directed 
against the BASB232 protein are also generated by vaccinating mice with the purified 
recombinant B ASB232 protein- Animals are bled prior to tiie first inomunization ("pre- 
bleed") and after the last immunization. 
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Anti-BASB232 protein titers are measured by an EUSA using purified recombinant 
BASB232 protein as the coating antigen. The titer is defined as mid-point titers 
calculated by 4-parameter logistic model using the XL Fit software.The antisera are also 
used as the first antibody to identify the protein in a westem blot as described in 
example 5 below. 

Example 4t Tmmimnlogical cha racterization; Surface exposure of BASB232 
Anti-BASB232 proteins titres are determined by an ELISA using formalin-kiUed whole 
cells of Bordetella pertussis (B. pertussis). The titer is defined as mid-point titers 
calculated by 4-parameter logistic model using the XL Fit software. 

Example 5. Immo nological Characterisation; Western Blot Analysis 

Several strains of B.pertussiSy as well as cliaical isolates, are grown on Bordet Gengou 
agar plates for 24 hours at 36°C and 5% CO2- Several colonies are used to inoculate 
Tryptic Soy Agar (TSA) broth supplemented by NAD and hemin, each at 10 p,g/ml. 
Cultures are grown until the absoibance at 62Qnm is approximately 0.4 and cells are 
collected by cmtrifiagation. Cells are then concentrated and solubilized in PAGE 
sample buffer. The solubilized cells are then resolved on 4-20% polyacrylamide gels 
and the separated proteins are electrophoretically transferred to PVDF membranes. The 
P VDF membranes are then pretreated with saturation buffer. All subsequent 
incubations are carried out using this pretreatment buffer. 

PVDF membranes are incubated with preinmnme serum and rabbit or mouse immune 
serum. PVDF membranes are then washed. 

PVDF membranes are incubated with biotin-labeled sheep anti-rabbit or mouse Ig. 
PVDF membranes are then washed 3 times with wash buffer, and incubated with 
streptavidin-peroxydase. PVDF monbranes are then washed 3 times with wash buffer 
and developed with 4-chloro-l-naphtol. 
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Example 6: Pre58eiice of Antibody to BASB232 In Hainan Convalescent Sera 



Western blot analysis of purified recombinant BASB232 is performed as described in 
Example 5 above, except that a pool of human sera firom children infected by 
B.pertussis is used as the first antibody preparation. 

Example 7; Efflcacv o f BASB232 vaccine: enhancement of Inng clearance nf 
B.vertussis in mice. 

This mouse model is based on the analysis of the lung invasion by B.pertussis following 
a standard intranasal challenge to vaccinated mice. 

Groins of mice are immunized with BASB232 vaccine. After the booster, the mice are 
challenged by instillation of bact^al suspension into the nostril under anaesthesia. 
Mice are killed between 30 minutes as well as 2, 5 and 8 days after challenge and the 
lungs are removed aseptically and homogenized individually. The loglO weighted mean 
number of CFUAung is determined by countmg the colonies grown on agar plates after 
plating of dilutions of the homogenate. The arithmetic mean of the loglO weighted 
mean number of CFUAung and the standard deviations are calculated for each groiq). 
Results are analysed statistically. 

Jn this experiment ffoups of mice are immunized either with BASB232 or with a killed 
whole cells (kwc) preparation of B.pertussis or sham immunized. 

Example 8; Useful Epitopes 

The B-cell epitopes of a protein are mainly localized at its surface. To predict B-cell 
epitopes of BASB232 polypeptides two methods were combined: 2D-structure 
prediction and antigenic index prediction. The 2D-structure prediction was made using 
the PSEPRED program (firom David Jones, Brunei Bioinformatics Group, Dept. Biological 
Science, Brunei University, Uxbridge UBS 3PH, UK). The antigenic index was 
caloulafe^ on the basis of the method described by Jameson and Wolf (CABIOS 4:181- 
186 [1988]). The parameters used in this program are the antigenic index and the 
minimal length for an antigenic pq)tide. An antigenic index of 0.9 for a mm\mim of 5 
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consecutive amino acids was used as threshold in fhe program. Peptides comprising 
good, potential B-cell epitopes are listed in table 5. These can be useful (preferably 
conjugated or recombinantly joined to a larger protein) in a vaccine composition for the 
prevention of Bordetella infections, as could similar peptides comprising conservative 
mutations (preferably 70, 80, 95, 99 or 100% identical to fhe sequences of table 5) or 
truncates comprising 5 or more (e.g. 6, 7, 8, 9, 10, 1 1 or 12) amino acids therefrom or 
extensions comprising e. g. 1, 2, 3, 5, 10 further amino acids at either or both ends from 
the native context of BASB232 polypeptides which preserve an effective epitope which 
can elicit an immune response in a host against the BASB232 polypq)tides. 

Table 5: Potential B-cell epitopes from S£Q ID NO:34 



Position 


Sequence 


56 


QDAGQEGEF 


84 


DDDPDELGE 


106 


EHKNPMS 


236 


PGFPPPPPPLP 


265 


GQDGK 


339 


DGANT 


381 


TIiRQTRI 


404 


PQSGSG 


538 


DGNKPL 


579 


ADSRVQD 


599 


APEASY 


628 


QNDQIj 


636 


GRADGQ 


653 


ADSRGA 


692 


AEDPKT 


753 


TFSERQQISNRH 


766 


RAYDQT 


785 


ASGGRW 


800 


YADRTYPGDGGG 


839 


GRYDQQY 


858 


DYRTSG 


869 


BGGRRF 


893 


TSGKRYRASN 


944 


QBFKSTGDVRTNG 


962 


AGRHGR 


1004 


AGYRYSF 



The T-helper cell epitopes are peptides bound to HLA class n molecules and recognized 
by T-helper cells. The prediction of useful T-helper cell epitopes of BASB232 
polypeptides was based on the TEPITOPE method describe by Stumiolo at al, (Nature 
Biotech. 17: 555-561 [1999]). Peptides con^rising good, potential T-cell epitopes are 
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listed in table 6. These can be useful (preferably conjugated to peptides, polypeptides or 
polysaccharides) for vaccine purposes, as could similar peptides coniprising 
conservative mutations (preferably 70, 80, 95, 99 or 100% identical to the sequences 
below) or truncates comprising 5 or more (e.g. 6, 7, 8, 9, 10, 1 1, 12, 14, 16, 18 or 20) 
amino acids therefrom or extensions comprising e. g. 1, 2, 3, 5, 10 further amino acids 
at either or both ends Scorn the native context of BASB232 polypeptides which preserve 
an effective T-helper epitope from BASB232 polypeptides. 
Table 6: Potential T-helper cell epitopes from SEQ ID NO:34 



Position 


Sequence 


20 


WRLHAIAAAUV 


34 


MARXiAPAAA 


105 


VBHKNPMSK 


118 


VRVSGAGRA 


144 


WRRGGTLEIiDGVTVA 


164 


MEPMTVSDA 


192 


IjVRAAQGGQ 


208 


IjQSIIiGPALIADGGSIS 


270 


VTLREVAIjRAHGPQAT 


287 


VYAYMPGSEI 


298 


LQGGTVSVQ 


329 


VRLDGTTVS 


347 


LVRGDAARAEWNTVIjRTAKSIiAA 


380 


VTIiRQTRIE 


420 


ITTTGNRAA 


444 


VRAEGSGSS 


461 


LWSAGSIiAS 


483 


LKLMPGAIjASS 


497 


VRLTDGATA 


513 


IiQQHSTIPV 


535 


IVADGNKPL 


563 


VLQSATIiGK 


576 


WNADSRVQD 


586 


MSMRGGRVEFQAPAPE 


609 


LQTIiDGNGVPVLNTNVAA 


644 


VLVRNAGGEA 


660 


LGLVHTQGQ 


673 


FRLANVGKAVD 


687 


WRYSIiAEDP 


699 


VWSIiQRAGQAIiS 


725 


lAIiAESNAL 


740 


LRLRADAGGPWARTFSERQ 


8^4 


YVGDGGYYLDTVIjRLGRYDQ 


845 


YNIAGTDGG 


S59 


YRTSGAAWS 


888 


VMLWRTSGKRYRASNGLRV 


915 


IjGRXjGIjRPGRRIAIiAG 


934 


VQPyARLGW 


952 


VRTNGIGHA 


996 


INIPWSFHA 



100 



wo 2005/032584 



PCT/EP2004/011082 



Hie same analyses (B-cell epitopes prediction and T-helper cell epitopes prediction) 
could be done for each BASB232 polypeptide sequence comprised in SEQ Groiq> 2. 

Example 9; Expression of BrkA 

BrkA is a 1010 anodno acid protein. A pET30b egression vector containing amino 
acids 60-702 of BrkA, fiised at the C and N-terminal to a 6x-"His tag was used to 
express BrkA in E, coli. The bacteria were grown to an OD600 of approximately 0,6 
and induced with ImM IPTG for 2 hours. Recombinant BrkA was purified under 
denaturing conditions using the protocol in the Xpress System Protein Purification 
manual (Invitrogen, Carlsbad, CA). The bacteria were lysed in 6M guanidine 
hydrochloride and the lysate was applied to to Ni2+ -nitrilotriacetic acid agarose 
(Qiagen, Mississauga, Ont). After successive washes in 8M urea of decreasing pH, 
purified BrkA was eluted at pH 4 and the fractions were pooled. The urea was 
removed by slow dialysis at 4 ""C against lOmM Tris, pH 8.0 in the presence of 0.1% 
Triton X-100. 

The purified protein was refolded so that it has a beta-structure resembling PRN 
(monitored by CD) and was functional in a serum assay. The protein was filter sterilized 
and is diluted in lOmM Tris buffer, pH 8 at a concentration of approximately 0.4mg/ml. 

Example 10; Formulation of vaccines 

Six vaccines were formxilated: 

L DTPa018A2 - contains 6.25Lf DT, 2,5Lf TT, 6.25ug PT and 6.25ug FHAper 
mouse dose. All antigens were adsorbed separately onto Al(OH)3 before 
combining. 

2. DTPal4885B9 - contains 6.25Lf DT, 2.5Lf TT, 6,25ug PT, 6.25ug FHA and 2ug 
pertactin per mouse dose. . All antigens ware adsorbed separately onto Al(OII)3 
before combining. 
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3. DTPW13126A9 - contains 6.25Lf DT, 2.5Lf TT and 1 LU.whole ceU B. pertussis 
per mouse dose. All antigens were adsorbed separately onto Al(OH)3 before 
combining. 

4. DTBrkA - contains 6.25Lf DT, 2.5Lf TT and 2,5ug BrkA per mouse dose. All 
antigens were adsorbed separately onto Al(OII)3 before combining. 

5. pTPa-2 BrkA - contains 6.25Lf DT, 2.5Lf TT, 6.25ug PT, 6.25ug FHA and 2.5ug 
BrkA per mouse dose. . All antigens were adsorbed separately onto Al(OH)3 
before combining. 

6. DTPa-3 BrkA - contains 6.25Lf DT, 2.5Lf TT, 6.25ug PT, 6.25ug FHA, 2ug 
pertactin and 2.5ug BrkA per mouse dose. All antigens were adsorbed separately 
onto Al(OH)3 before combining. 

Example 11; Protection against lung mvasion by B. pertussis in an g nlmal mnrirf 

Groups of 20 BALB/c mice (females, S weeks old) were immunized subcutaneously 
with Va of a human dose (125 \i\ of vaccine) and were boosted 3 weeks later. One week 
after &e booster, a sample of blood was collected firom each mouse for antibody 
determination. The mice were then challenged by instillation of SO |il of bacterial 
suspension (+/- 5 10^ CFU/50 ^il) into the left nostril imder ether anestiiesia. Five mice 
in each group were killed at 4 different times (2 hours, 2, 5 and 8 days) after challenge 
and the lungs were removed aseptically and homogenized individually. The loglO 
weighted mean number of CFUAung was determined by counting the colonies grown 
on Bordet-Gengou agar plates after plating of 100 nl of 4 serial dilutions of the 
homogenate. The arithmetic mean of the loglO weighted mean number of CFU/lung 
and the standard deviation were calculated for each groiqp and each time point 
Three experiments were performed. The same vaccine groiq)s were included in each 
experiment but animals were either challenged with B, pertussis strain Tohama, B. 
pertussis strain 18323 or B, parapertussis. 
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The day before challenge, blood was collected from each mouse. The anti-PT, and- 
FHA, anti-PRN and anti-BrkA antibody levels were determined by PJJSA, The 
geometFc mean titre for each group of mice was calculated. 

Example 12; B. pertussis strain Tehama challenge 

Jn one eisperiment immunized mice were challenged with B. pertussis strain Tohama. 
The niunber of CFU/lung at each timepoint after challenge and for each group is 
summarized in the figure 1 . The eKperiment had low variability with the data having a 
mean square error of is 0.450. 

Statistical analysis of the data using ANOVAl was used to assess the data. No 
significant diffemece was seen beween the protection against B. pertussis offered by 
DTBrkA conpared to control^ indicating that immunisation with BrkA alone is 
insufiScient to elicit protection. In contrast, the addition of BrkA to a DTPa-2 vaccine 
produced a statistically significant increase in protection showing that, in combination 
with FT and FHA, BrkA can produce additional protection. The level of protection 
conferred by DTPa-2 BrkA was statistically slightly less than that conferred by DTPa- 
3 which conferred protection statistically equival^t to DTPw. The DTPa-3 BrkA 
vaccme provided excellent protection from challenge after 2 and 5 days but less 
protection after day 8. 

Example 13: B. pertussis strain 18323 challenge 

In~tiiis experiment, immunized mice were challenged with B. pertussis strain 18323. 
The number of CFU/lung at each timepoint after challenge and for each group is 
summarized in figure 2. The exp^iment showed low variability with the mean square 
error of the experiment being 0.402. Statistical analysis using ANOVAl showed that 
again DTBrkA did not provide an significant protection over the control. However, 
DTPa3 BrkA provided better protection than DTPa, showing that BrkA, in 
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ccmibination with other B. pertussis antigens gives additional protection. Hie 
protection achieved by vaccination with DTPa-2 BrkA|, against challenge with B. 
pertussis strain 18323, was statistically equivalent to that provided by DTPa-3, DTPa- 
SBrkAandDTPw. 

Example 14; Comparison of protection against B. pertussis and B. panwertussis 
in mice vaccinated with DTPw or DTPa 

Groiq^s of 25 or 30 BALB/c noice (females, 5 weeks old) were immunized 
subcutaneously with % of a human dose of DT, DTPa or DTPw j&om different sources 
(125 \x\ of vaccine) and were boosted 3 weeks later. The sources of DTPw were Triple 
antigen (CSL), Tri-immune (Lederle), Pentacoq (MSD), Combivax (Behring), lofanrix 
(SB), DKTP (RVM), DTPw (Connaght) and Trivax (Wellcome). The sources of DTPa 
were Infanrix (SB), Triacel (PMCS), DI-TE-KIK (Amvax), Acell-immune (Lederle), 
Tropedia (Biken), Tricelluvax (Biocine/Chiron), Pentavac (PM-MSD) andDTPa-2 
(SB). One or two weeks after the booster, a sample of blood was collected from each 
mouse for antibody determination. The noice were then challenged by instillation of SO 
fil of bacterial suspension (+/- 5 10^ CFU/50 (il) into the left nostril under anaesthesia. 
Five mice in each group were Idlled at 5 or 6 different tunes (ranging from 2 hours to 
14 days) after challenge and the lungs were removed aseptically and homogenized 
individually. The loglO weighted mean number of CFU/lung was determined by 
coxmting the colonies grown on Bordet-Gengou ^ar plates after plating of 100 nl of 4 
serial dilutions of the homogenate. The arithmetic mean of the loglO wei^ted mean 
number of CFU/lung and the standard deviation were calculated for each group and 
each time point. 

The day before challenge, blood was collected from each mouse for determination of 
the anti-PT, anti-FHA and anti-PRN antibody levels by ELIS A. The geomean titre for 
each group of mice was calculated. 

Results were analysed statistically by applying 1-way and 2-way ANOVA after 
assuming equality of variance (checked by Brown and Forsythe's test) aiKi normality 
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(checked using tiie Shapiro-Wilk test). Differences between groups were analysed 
using the Dunnet test, Tukey's studendsed range test (HSD) and Student-Newman- 
Keuls test. 

Results are shown in figure 3, and show that the DTPw vaccines induced good lung 
clearance of B. pertussis and B, parapertussis. However, DTPa vaccines induced 
strong lung clearance ofB. pertussis but only limited clearance ofB, parapertussis. 

This experiment demonstrates that antigens presoit in the DTPw vaccine and absent 
firom the DTPa vaccine are protective against parapertussis. Moreover, the antigens 
present in the DTPa vaccine, specifically FHA and pertactin, may display antigenic 
variability when compared to the corresponding parapertussis antigens. 

Example 15: Protection against B. parapertussis bv immuiiization with DTPw 

Protection against a B, parapertussis challenge in the mouse lung clearance model can 
be obtained by active immunization with a DTPw vaccine (composed of J5. pertussis 
killed whole cells). 

In this experiment mice were unmunnized twice (day 0 and 21) with 1/4 human dose of 
DTPw vaccine and were intranasally challenged with B, parapertussis on day 28. 
Protection was measured by counting B. parapertussis in the lungs 0 (2 hours), 2, 5 and 
8 days after challenge. 

The results sho^vn in Figure 1 5 A demonstrate that immimization with DTPw resulted in 
a large decrease in the number of B. parapertussis present in the lung, compared to 
control mice. 

As shown in Figure ISB, protection against B. parapertussis can also be obtained after 
passive immunization using an anti-DTPw serum. In this case, 500 |iL of anti-Pw serum 
cr'nomial mice scrum was injected intraperitoneally to mice 20 hours before iatranasal 
challenge with B. parapertussis. Bacteria remaining in the lungs were assessed 0, 1, 2, S 
and 8 da>^ after challenge. 
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As shown in Figure 15B, passive immunization with antisera against DTPw resulted in 
a sizeable reducetion in the number of B, parapertussis in the lungs of mice compared 
to the control groiq>. 

Protection observed against B, parapertussis after active and also passive immunization 
with a DTPw vaccine containing B. pertussis killed whole cells suggests that cross- 
protective epitopes exist between the two strains. 

Example 16; Identiflcation of cross-protective Bordetella antigens 
Materials and Methods 

Outer membrane protein samples 

B. parapertussis outer membrane proteins (OMPs) samples were isolated from 
ATCC15237, using the protocol described in Inf. hmn. 1998, 66(8), 3597. This 
provided B. parapertussis OMPs at around 64 |xg/ml. 

Gel electrophoresis (SDS-PAGE) 

Four aliquots of (ecu 13 fig each) of OMPs sample were pooled and evaporated to 
dryness under vacuum. The residue was solubilised in 100 p.1 of 0.1% w/v n- 
octylglucoside in water. The sample was reduced by the addition of 10 \x\ of 200 mM 
tributylphosphme and all<ylated with 5 ^il of 2-vinjdpyridine. The reaction was 
performed at 4®C for 4 hours. The mixture was then precipitated and washed with the 
Amersham SDS-PAGE clean-up kit. The complete sample (36 \xi) was then dissolved 
with the SDS-PAGE sample buffer (36 jil) and reductant (7 [xl) provided with the Bio- 
Rad Criterion XT gels. 64 jil of water were added and flie mixture was heated at 95*^0 
for 5 min. 

Two gels wdth lanes loaded with 14|il (ca. 5fig) of sample each, were run using a MES 
(2-morpholinefhanesulfonic acid) running buffer to provide resolution for the small size 
range. Two other gels, also with lanes loaded with 14 [il of sample each, were run using 
MOPS (3-[N-moipholino] propanesulfonic acid) for resolution of the heaviest proteins. 
For each gel, on each side of the sample lanes, prestained and unstained standards 
(BioLabs cat no. P7708S and Bio-Rad cat no. 161-0317) were co-electrofocused as gel 
and Western alignment guides, respectively. 

Gels were stained with the E^lue Coomassie G250 colloidal reagent (Sigma). 
Western blotting 

hnmediately following gel electrophoresis, for each running buffer used, one gel was 
electroblotted onto a nitrocellulose membrane (Hybond ECL, Amersham), using a 
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transfer biiffer of 25 mM Tris, 192 mM glycine, 0.01% w/v SDS and 20% v/v methanol 
at A^'C with a voltage of 100 V for 1 to 1.5 hour. Transfer was accomplished using the 
Trans-Blot Cell wet system from Bio-Rad 

Electroblotted proteins were revealed by pool of mouse sera raised against B, pertussis 
(aDTPw), produced as described in Exaxnple 15, followed as secondary antibody by 
alkaline phosphatase-conjugate goat anti-mouse Ig<j/IgM, Western inmumoblots were 
visualised using AP chromogen kit nitro-blue tetrazolium (NBT)/ 5-bromo-4-chloro-3- 
indoyl phosphate potassium salt (BCIP) Scorn Promega. 

Gel and Western blot matching 

Immunoactive bands on the Western blot membrane were matched to the corresponding 
Coomassie stained gel band by alignment after re-sizuig of the molecular weight 
markers presmit on the scanned images. 

In-gel digestion and peptide extraction 

The bands were excised and cut into pieces (ca. 1 mm*^). They were then transferred to 
500 111 Protein Low Bind eppendorf tube. The gel pieces were washed three times with 
300 III of 50% v/v methanol and 5% v/v acetic acid in water for 15 min, followed by 
300 III of 100 mM ammonium bicarbonate for 5 min. Gel pieces were shrunken in 300 
[il of acetonitrile for 5 min and dried imder vacuum in a Speedvac evaporator for 5 min. 
The dried gel pieces were re-swollen in 100 jil of 5mM tributylphosphine, 100 mM 2- 
vinylpyridine and 100 mM ammonium bicarbonate at 4 °C for 1 hour. The supernatant 
was discarded 

The gel pieces were washed with 300 ^1 of 100 mM ammonium bicarbonate for 5 min 
and shrunken in 300 (xl of acetonitrile for 5 min. That step was repeated two times. The 
gel pieces were dried imder vacuum in a Speed vac evaporator for 5 min. 
In-gel digestion was performed overnight by the addition of SO |xl of a solution of native 
Bovin trypsin (100 roM ammoniimi bicarbonate, 5 mM CaCl2, 0.01% w/v n-octyl 
glucoside). The proteol3/tic peptide products were extracted by sonication in 5% formic 
acid and acetonitrile. The combined extracts were then reduced to approximately 20 jil 
by vacuum evaporation. 

Mass spectrometry 

Band digests were analj^ed by nanoLC-MS/MS using a Micromass QTof-2 tandem mass 
spectrometer (Micromass) operated by the Masslynx data system (Micromass). The Q-Tof 
was operated in survey-scan mode using a 1-s MS survey scan, from 400 to 1500 Da, 
followed by 1-s MS/MS scans on eigjit different precursor ions. Scorn 50 to 2000 Da. 
MS/MS spectrum acquisition was allowed for to a total of 4s on each precursor ion or 
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Stopped when the signal intensity fell below 3 counts/second. Precursors were excluded 
from any further MS/MS experiment for 200 sec. Only doubly and triply charged ions 
were selected as precursors for MS/MS. The cone voltage was of 35 V. The collision 
energy was adjusted according to the charge state and the mass of the precursor ion. The 
mass spectrometer was fitted with a nanospray SilicaTip eimtter (New Objective). It was 
tuned to reach a resolution (fwhm) between 6000 and 8000 at m/z 432.9 (angiotensin). 
The capillary voltage was set between 2000 and 2700 V. The source temperature was 100 

The HPLC system was a CapLC (Micromass) using a ten port-valve enabling on-line 
desalting. The trapping cartridge was a CI 8 Symmetry (5 x 0,3 mm - 5 jim - 300 A) 
(Waters) operated at 10 jil/min with 1% v/v methanol and 0.1% TFA v/v m water. After 
8 min the tra-port valve was switched. The analytical colunms were Atlantis CI 8 (150 x 
0.075 mm - 3 ^m - 100 A) (Waters). They were operated at 250 nl/min at room 
temperature. An AB gradient was run from 5 to 40% B in 30 min. The eluent A was 
0.1% v/v formic acid and 2% v/v acetonitrile in water. The eluent B was 0.1% v/v 
formic acid and 5% v/v water in acetonitrile. The equilibriimi time after gradient elution 
was of 20 min. 

Protein sequence database search 

Peak masses extracted from electrospray MS/MS spectra using the PeptideAuto macro 
were used for protein identification using Mascot 1.9 software (Matrix Science). The 
hiterrogation was performed against the protein sequence database of Bordetella 
parapertussis strain 12822 (4185 sequence entries) downloaded from the Sanger 
Ihstitue web site (http://www.sanger.ac.uk). The mass tolerances allowed on the 
precursor and fragment ions were of 0.25 Da. Only proteins for which three trypsic 
peptides were identified were taken into account. 

RESULTS 

The alignment of the Western blots and the corresponding Coomassie-stained 
gels for the two running buffer systems used are displayed in Figures 5 and 6. B. 
parapertussis OMP gel bands detected by Western blotting using sera raised against B. 
pertussis (DTPw) were excised from the gel. Trypsin was added and peptides were 
eluted out of the gel bands as described in the Materials and Methods section. Trypsic 
peptides of each bands were chromatographied by nanoLC and sequenced by MS/MS. 
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MS/MS sequence data were used to search against the B. parapertussis protein sequeace 
database. Hits were manually evaluated for their scores, ion intensity and number of 
matched trypsic peptides. Gel bands identification are simmiarised in Table 7. 

For 6 MOPS gel bands and their MES equivalents , four membrane proteins 
were identified: 

i) bands #8 (MOPS) and #14 (MES): autotransporter (BPP0452 gene); 

ii) bands #9 (MOPS) and #1 5 (MES): TonB-dependent receptor for iron transport 
(BPP3376 or BFRD gene); 

iii) bands #12 (MOPS): outer membrane porin protein (BPP3392 gene); 

iv) bands #13 (MOPS) and #18 (MES): outer membrane protein A (BPP3135 or 

OMPA gene). 

The identification of the autotransporter protein (BPP0452 - Theor. MW =177 
kDa) in gel bands at approximately 60 kDa could be tentatively assigned to a proteolytic 
fragment or to other autotransporter proteins displaying a high level of homology. This 
protein is the B. parapertussis homologue of SEQ ID No: 50, sharing 94% identity. 

The TonB-dependent rec^tor for iron transport (BPP3376) is the B. 
parapertussis homologue of SEQ ID No: 14, sharing 98% identity. 

A cytoplasmic chaperonin protein was idmtified for the gel bands #10 (MPOS) 
and #16 (MES) that could be associated to a contamination of the outer membrane 
fraction during the sanq)le processing. 

CONCLUSION 

The method identified several antigens which generate an immune response which is 
cxFossareactive between B. pertussis and B. parapertussis. The proteins encloded by SEQ 
ID NO: 50, 100, 14, 106, 108, 110, 102 and 104 are such proteins. The ideaitified 
proteins isolated from either B. pertussis or B, parapertussis could be usefully 
incoiporated into an acellular vaccine to provide protection against both B. pertussis and 
5. parapertussis disease. 
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Gel band 


Hit # (Score*) 


Protein description (Tlieor. MW in Da) 


Nb peptides 


Gene 


(running buffer) 






matched 






7 (821) 


autotrmsporter (1 76978) 


15 


BPF0452 


#8 (MOPS) 












2(55) 


translation initiation &ctor (103481) 


3 


BPP1862 


#14(MES) 




autotransporter (1 76978) 


16 


BPP0452 




2(112) 


outer membrane protein A (20985) 


3 


BPP3135 


#9 (MOPS) 


1 (229) 


TonB'dependent receptor for iron 


7 


BPP3376or 






transport (81321) 




BFBD 




2 (174) 


polyribonucleotide nucleotidyltransferase 


4 


BPP3431 






(77294) 








3 (123) 


probable surface antigen (86290) 


4 


BPP1535 


#15(M£S) 


1(314) 


TonB-dependefit receptor for iron 


8 


BPP3376or 






transport (81 321) 




BFRD 




2(229) 


polyribonucleotide nucleotidyltransferase 


7 


BPP3431 






(77294) 






#10 (MOPS) 




chaperonin (57447) 


27 


BPP0868 


#16 (MES) 


1 (621) 


chaperonin (57447) 


16 


BPP0868 


#11 (MOPS) 


1 (446) 


autotransporter (1 76978) 


10 


BPP04S2 




2 (288) 


elongation fector (42889) 


6 


BPP0007 


#17 (MES) 


1 (388) 


autotransporter (1 76978) 


8 


BPP04S2 




2(271) 


elongation factor (42889) 


6 


BPP0007 




3(112) 


putative membrane protein (47410) 


3 


BPP2847 




4(98) 


enolase (45885) 


3 


BPP3252 


#12 (MOPS) 


1 (528) 


outer membrane porin protein (41319) 


18 


BPP3392 


#13 (MOPS) 


1 (209) 


outer membrane protein A (20985) 


6 


BPP3135or 










OMPA 




2(151) 


outer membrane porin protein (41319) 


5 


BPP3392 



#18 (MES) 1(196) outer membrane proteui A (20985) 6 BPP3135 or 

OMPA 

Score" - The MOWSE score algorithm is descaibed in DJC P^pin et al Cuix. Biol., 
3(6); 327-32 (1993). 

Table 7 



110 



wo 2005/032584 



PCT/EP2004/011082 



Deposited materials 

A deposit of strain 3 (strain 3224A) has been deposited with the American Type Culture 
Collection (ATCC) on May 5 2000 and assigned deposit number PTA-1816. 

The B. pertttssisjtrsdn deposit is referred to herein as "the deposited strain" or as "the DNA 
of the deposited strain." 

The deposited strain contains a fiili length BASB232 gene. 

The sequence of the polynucleotides contained in the dq>osit6d strain, as well as the amino 
acid sequence of any polypeptide encoded thereby, are controlling in the event of any 
conflict with any desoiption of sequences herem. 

The deposit of the deposited strain has been made under the terms of the Budapest Treaty on 
the hitemational Recognition of the Deposit of Micro-organisms for Purposes of Patent 
Procedure. The dq)osited strain will be irrevocably and withoxit restriction or condition 
released to the public upon the issuance of a patent The deposited strain is provided merely 
as convenience to those of skill in the art and is not an admission that a deposit is required 
for enablement, such as that required under 35 U.S.C. § 1 12. A license may be required to 
make, use or sell the deposited strain, and compounds derived therefrom, and no such 
license is hereby granted. 
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SEQUENCE INFORMATION 

BASB232 Polynucleotide and Polypeptide Sequences 

SEQ group 1 contains SEQ ID NOS: 33, 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 35, 27, 29, 31, 
35, 37, 39, 41, 43, 45, 47, 49, 51, S3, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 

87. 89. 91. 93. 95 and 97. 

SEQ group 2 contains SEQ ID NOS: 34, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 

88.90. 92.94.96 and 98. 

SEQ ID NO;l polynucleotide sequence of Orfl 

Atgtccaccccccgattcgcgctgcattacgccagcgcgtcagtcctgctggccgcatcc 
Ggcctggccatggcgcagacggccacccagatccacgatccgtcgcaggtgcagcagatg 
Gcgacggtgcaggtgctgggcacggccgaagaggaaatcaaggagtcgctgggcgtctcg 
gtcatcaccgccgaggagatcgcccgccgcccgcccaccaatgacctgtccgacctgatc 
cgccgcgaacccggcgtcaacctgaccggcaacagcgccagcggcgcgcggggcaacagc 
cgccaggtcgacatccgcggcatgggccccgagaacaccctcatcctgatcgacggcaag 
cccgtcacctcgcgcaatgcggtgcgctatggctggaacggcgaccgggacacgcgcggg 
gacaccaactgggtgcccgccgaggaagtcgagcgcatcgaagtgatccgcggcccggcc 
gccgcccgctacggttccggggccatgggcggcgtggtcaacatcatcaccaagcgcccc 
gccgatcgcgccaccggctccatcacctactacacgaaccagccggaagacagccgcgag 
ggcaacaccaaccgcgtcaatgcgcgcatcagcgcgccgatcagcgacacgctgagcatg 
cggctgtacggcaactacaacaagaccaatccggatgcccgcgacatcaacgccggccac 
gcgaacaccagcgacaacggcaacccctcgaccgccggacgcgagggcgtcatcaaccag 
gacctgagcgcgctgttctcgtggaaagccgacagccacaacaccgtggacctggacatg 
ggcttcagccggcagggcaacctgttcgccggcgacaccatgaacaacgccaacagcgac 
ttctcggacagcctgtacggcaaggaaaccaatgcgatgtaccgcgagaactatgcgctg 
acgcaccgcggcgtctacgactggggcacctcgcgcgccagcgtcggctatgactacacg 
cgcaacgcgcgccagcgcgaaggcctggccggcggccccgagggcgcgcccaccgcgggc 
ggctacgacaccgcgcgcctgaagaactggcgcgccgcggccgaggccagcgtgccgttc 
catctcggtttcgagcaggtcgccacggtcggcgtggaatggctgcgcgaatcgctggaa 
gaccccgccggcacgcgccagacctataccggcggcgccatcggcggcacggccccggcc 
gaccgcgacccgaaatcgcgccagaccagctatgcgctgttcgccgaggacaacatcgag 
at cgacgagcgcaccat get cacgcccggcgt gcgcc t ggaccacaacagcgaa 1 1 cggc 
agcaactggagtcccagcctgaacgcctcgtacgccgtcaccgacgcgctcaagctcaag 
ggtggcatcgcgcgcgcctacaaggcgcccaacctctaccaatccaaccccaactacctg 
ctgtacagccgcggcaatggctgcctggcctcgcagaccaacaccaacggctgctatctg 
gtcggcaacgaggacctctcgccggaaaccagcgtcaacaaggaaatcggcttcgagtac 
gacccgggcacgtggcgcaccagcatggcctatttccgcaacgactaccgcaacaagatc 
gtcgccggcaccgacgtccagtaccgcctggccaatggcgcccgggtgctgcaatggacc 
aacagcggcaaggccgtggtcgaagggctggaaggcaacctgtt cat tccgctggc cage 
aatctcgactggaacaccaacttcacctacatgafcccagtccaaggaaaaggctaccggc 
gaacccttgagcgtgattcccgaatacaccatcaacagcacgctggactggttctacacg 
ccgcagctgtcgttccaggccaatctcacctattacggcaagcaggaaggcGcgtccacc 
aatgtacgcaccggcgtcgaactgaacggcgacggccgccagaccatcagtccgtatgcG 
ctggcgggcctgagcatgggctacgaagtcaaccggaacctgaagttccgcgtcggcgtg 
agcaacctgttcgacaagcagctgtaccgcgaaggcaatgccagcagcgcgggcgcggcc 
acctacaacgaaccggggcgcgcctattacgccacggcgacggtgtcgttctga 
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SEQ ID NO:2 polypeptide sequence of Orfl 

MSTPRPALHYASASVLLPJ^GLAMAQTATQIHDPSQVQQMA 

VITAEEIARRPPTNDIiSDIiIRREPGVNIiTGNSASGARGNSRQVDIRGMGPEOTLIIilDGK 

PVTSRNAVRYGWNGDRDTRGDTNWVPAEEVERIEVIRGPAAARYGSGAMGGVVNIITKRP 

ADRATGSITYYTNQPEDSREGOTNRVNARISAPISDTLSMRIiYGNyNK^ 

AOTSDNGNPSTAGREGVINQDIiSALFSWKADSHNTVDLDMGFSRQGNLPAGDT^ 

FSDSLYGKETNAMyREmALTHRGVYDWGTSRASVGYDYTRNARQREGr^ 

GYDTARIiKmniAAAEASVPPHIiGFEQVATVGVEWLRBSLEDPAGTRQTYT^ 

DRDPKSRQTSYAIjFAEDNIEIDERTMLTPGVRLDHNSEFGSNWSPSIiNASYAVTDALK1.K 

GGIARAYKAPNLYQSNPNYLLYSRGNGCLASQTNTNGC^ 

DPGTWRTSMAYFRNDYRNKIVAGTDVQYRLANGARVLQWTNSGKAVVEGLEGNIjFIP 
NLDWOTTSrFTYMIQSKEKATGEPLSVIPEYTINSTLDWFYTPQI*SFQAlSn:iT^ 
NVRTGVEIiNGDGRQTI S P YALAGLSMGYEVNRNLKPRVGVSNLFDKQIiYREGNASSAGAA 
TYNEPGRAYYATATVSP 

SEQ ID NO:3 polynucleotide sequence of OrfZ 

Gtgttttctcgcagtcagaagcatccgtcctggcgcctgtcgccatgcgtacttgcggcc 
Gccttgtgcgccgtcgcggtcggtagcgcggacaccgcccgcgcgcaggcgcccgccgcc 
Agcgcccagcattatgaaatcgcggccggaccgctggccgacgcactgacccgcttcgcg 
cgccgtgccggcgtggtgctgtcgttcgacccggccctggtgcaggggcgcagcacggcg 
ggcctgcagggcgtgtacggcgtgcgcgacgggttcgcggcgctgctggccggctcgggc 
ctgcaggcgcgcgccggcggcggcaacaactggtcgctggcggcgctgccgcgcggcggc 
gatgcgcagacgctggcgccggtgacggtgctgggcctggagggcgcgctggcgcccacg 
gtcggctatgtcgccagtgccagcctgagcggcaccaagaccgatacgccgctgatcgaa 
acgccgcaat cga 1 1 1 cggt ggt gac t cgcgaccaga t aaccgagcagggcgcccagacg 
ctgaaccaggtgctgcgctataccgccggcgtggcgaccgagacgcgcggcgcgaccgcg 
acgcggctggaccagttcagcgtgcgcggtttctccgccgccacctatctggacggcatg 
cgcgtgttcggcggccgggacgccttgccccaggtcgacgcctaccggctcgaacgggtc 
gatgtgctcaaggggccggcttccgtgctgtatggccagggcggcccgggcggcgtggtc 
aaccaggtcagcaagcgtcccctggacgagcctttgcgcgagatcgaagtgcaggcgggc 
aatttcgatttccggcgggtcaacatggatttttccggcccggtggacgaggaccggcgc 
ttcctgtaccgggtaaccggcgcggcctatatgtccgatggccaggtggatcacaccagg 
gagcgccgctacttcgtctcgccgtcgttcacgtggcggcccagcgcggataccacgctg 
accgtgctgaccaacttccagcgcgaccccgacatgggttcgtacggatcgatctcggcc 
atgcgcacgctgctgtcggcgcccgacggcaggcggctgggcccgaaccactacgacggc 
gacgccgatttcgaaaagagcgaccgccgcagctattcgctgggctatcaactggagcat 
cgcttcaacgataccttcaaggcctcgcagaacctgcgtttccagcatgccgagggcgtc 
tatcgcagcatctacggcgccagcaacaacaattacggctatctcgacaaggactaccgc 
tactcgcagcgcggcctggccatcagcgacgtggacgtggatgcgttcacgatcgacaac 
aacctgcaggcgcgcttcgataccggggcgctggcgcatacggtgctggtggggttcgac 
taccagcgcgtgcagaccgacaccttgtcgggctatggcagcgcgccgccgctcgacgtg 
ttcgatccggactatcacatgggtatcgagcggccgccgtttacgtccgatcagacccag 
tacaactaccagaccggcctctacctgcaggaccagatcaggctggatcgcctgtcgttg 
ctgctgggcgggcgctacgactggtcgcgcacccacaccggcaccgacaacctggccaac 
ggctcgcacagcagctcggcgctcgccgccgaggcgttcaccggccgggtcggggcgatc 
tacaacttcgacaacggcgtggcgccgtacgccagctactcggagtcgttcgagccgcag 
accggcacgggctggaacaacacgccgttcaagccgaccgaaggcaagcagtatgaggtc 
ggcgtgaaataccagccgccgggctcggccacgctgctcacgctggcggccttcgacatc 
cggcgcaagaacctgcccaccaccgacccggatcccacccatatgtgcggcgtttcgcgc 
tgctcgatacaggccggcgaagtgcgcacccgcggcatcgaactggaggccaagaccgaa 
ccgctgcgcggcctgagcctgatcgccgcctattcgtacctcgacaacgaatacgagaag 
gcctatccgaacacgaccgggttggacctcaagggcaagaagccggtggccgtgccggcg 
caccaggcgtcggcctgggcccgctatcaactgcaggagggcccgctggccggcctgggc 
atgggcgcgggggtgcgctacatcggcagttcgtacgccaacgaaaccaacacgctcaag 
gtgccatcggtgacgctggtggacatgatgctcgactacgacctgggccgggccagcccc 
gcgctcaagggcatgcaggtggcgttgaacgtctccaacctgttcgacaaggaatacatc 
ggctcgtgcctgtccgattcgtggtgctggtatggctaccagcgttcgatcaaggccagc 
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ttgcgctatcgctggtga 



SEQ ID NO:4 polypeptide sequence of Orf2 

VFSRSQKHPSWRLSPC^niiTUVAIiCAVAVGSAOTAiy^QAPAASAQHYE 

RRAGVVXiSFDPALVQGRSTAGIiQGVYGVRDGFAJUjIAGSGLQAIUVGGGN^ 

DAQTLAPVTVLGLEGAIAPTVGYVASASriSGTKTDTPLI ETPQS I S WTRDQ ITEQGAQT 

LNQVIiRYTAGVATETRGATATRliDQFSWGFSAATYLDGMRVFGGRDALPQVDAy^ 

DVLKGPASVLYGQGGPGGVWQVSKIiPIiDEPIjREIEVQAGNFDFRRVNMD^ 

FLYRVTGAAYMSDGQVDHTRERRYPVSPSFTWRPSADTTLTVIiTNFQRDPDMGSyGSISA 

MRTLLSAPDGRRLGPNHYDGDADFEKSDRRSYSLGYQLEHRFNOTFK^ 

YRS I YGASNNNYGYLDKDYRYSQRGIiAI SDVDVDAFTIDNNLQARFDTGALAHTVLVGFD 

YQRVQTOTIiSGYGSAPPLDVFDPDYHMGIERPPFTSDQTQYNYQTGLYLQDQIRLDRLSIj 

MjGGRYDWSRTHTGTDNIiANGSHSSSAIiAAEAPTGRVGAIYNFDNGVAPYASYSE^ 

TGTGWNOTPFKPTEGKQYEVGVKYQPPGSATIiLTLAAFDIRRianJPl^ 

CSIQAGENmTRGIELEAKTEPIiRGIiSIilAAYSYLDNEYEKAYPNTTGLDLKGKK 

HQASAWARYQLQEGPLAGIX5MGAGVRYIGSSYANEmIaJKVPSVTIlVDM^^ 

ALKGMQVALNVSNIiFDKEYIGSCIiSDSWCWYGYQRSIKASIjRYRW 



SEQ ID NO:5 polynucleotide sequence of Or£3 

Atgcgagccagaccgagcgcggcaacccgcgccctgcactcatcgtcgcgcccccgccgc 
Ctgatccctgccctgctgggcgcgcttgcctgcctgggcactggcgtacaggccgcgccc 
Atcgacgtcgatatcccgccccagaacctggcccaagccctgcaccagcttggccggcaa 
gccaacctgcaggtcctgtacagccaggacctggtcgatggccagcgcagccccgccgtg 
cagggccgcatggaacctgccgaagcgctggaacgcctgctgaaaggccgcaacatccgc 
tattcgatccagcacaacaccgtcacgctcacgcccatgccgctgactgcgacgctgccg 
gcaatcagcgtggtcggcgccctgcctgactccgacacctacgtggccacaggcacgaca 
gccggcaccaagacggacacgccgctgatcgaaataccgcaatccatttcggtggtgacc 
gccgcgcagatccgcgagcagaatccgcagacgctgggcgacgccgtgcgctacacgccc 
ggtatcgtggtgcaggaaggattcaaccgcaccgacgatcctttcatcatccgcggcttc 
gacgtccgcaccaatcccggcgtcatgttccgcgacgggctgaaaatccccctgccccat 
tacagcgcgatgtccgaaccctacgcgctcgaacgcatcgaggtcgtgaaaggccccgct 
tcggtgctgtacggacaggcctcgccaggcggcatcgtcaacgtggtatccaagcggccg 
acagacagcccgctgcgcgagctgcagctgagcggcggctcgcacagcaacaggcagctc 
gccggcgacttcggcggacgcatcgacgacgagggacggctgacctaccgcctgaccggc 
ctggcgcgcaatgccgacacgatgatcgaccatgtcccggacgaccgctactatctcgcc 
cccgcgctgacctggcgcatcagcccggacacctcgctgacgctgctggcgagctacatg 
aagaacaagaccatcaacaacgccggctacccgctcgaaggcacggtcaagtacaacccc 
aacggccgcatcccgcgtcaccgcttcaccggcgagccggactggagcaagtgggaccag 
gaggtcgccaacgtggggtaccagtttgcgcaccgcttcaacgacacctggcaattcaag 
cagaacctgggctacgcccagtcgcgcaaccgcgtcaaccacgcctactggtggacctgg 
gtgcccggaagcgacttctccacggccgagcgcggcgcctaccgccgcgacgatgacgcc 
cacggcgtgagcatcgacaaccagttcgaggccacgtggcaatccggccgcttcaggcac 
aacacgctgttcggcctcgattacaccgaaacctcgttcacccgcaaacagtacgccggc 
tacaacaacctcgccccgatcgacttcttcgatccggcgtacggctcggacgtactgctg 
ccggcgaagccggacacctacaccaacgagaagcgcagccagctcggcctgtacttgcag 
gaccagataaagttcgacgacaaactggtggtggtgctcagcggccgctacgacaatgcc 
gacggctcgacgctgaacaagctgagcggcgtcaatacccgcaccggcgacaacgcgttc 
acgtggcgcaccggcctgctctaccttgcggacaacggtctggcgccctataccagctat 
tcgacttcgttccagccgcaggccggcaccacctcgcccgcacgcggcaccacgcccttc 
gacccgaccaagggcaagcagtgggaagccggcgtgaagtaccagcccaatggttcgaac 
tcgttcatcaccgcatccgtcttcgagctgacgcgcaccaacgtccccacgaccgacccc 
gccaaccccgtctacagcgtgcaggaaggcgaggtgcgctcgcgcggcctcgaattgtcg 
gccaccgccaacctggcctcgggctggaacctgatcgcggcctacacgtataccgacgcg 
gaaatcaccaagagcaactccaacacgctaggcaacacgcccgaggccgtgccgcgcaac 
atggcgtcgctatggtccgactacaccgtcccgtccggtgcgctggcggggctgaatatc 
ggcgccggcgt gcgc t acatgggc t cgacct acaacaacaccaat gccgccaaggt cggc 
gactacaccgtgttcgacgccgccctgcgctacgacttcggggcgcgcagcccgtccctg 



114 



wo 2005/032584 



PCT/EP2004/011082 



aaaggctggacggccgatctcaccgtgcgcaacctgttcgacaaggactacgtggcctcg 
tgcacctatgcctgcttctacggagaaggcaggaccgtgctgggccgggtcacgtacaaa 
tggtag 



SEQ ID NO:6 polypeptide sequence of Ori3 

MRARPSAATJRALHSSSRPRRLIPALIjGAIiACLGTGVQAAPIDVDIPPQNI^ 

ANIjQVIjYSQDLVDGQRS PAVQGRME PAEALERliLKGRNI R YS IQHNTVTLTPMPLTATIiP 

AISWGALPDSOTYVATGTTAGTKTIJrPLIEIPQSISVVTAAQIREQNPQTLGDAVRYTP 

GI WQEGFNRTDDPFI IRGFDVRTNPGVMFRDGLKI PLPHYSAMSEPYALERI EWKGPA 

SVIiYGQASPGGIVNWSKRPTDSPLREIiQLSGGSHSmQLAGDFGGRIDDEGRLTYI^ 

LARNADTMI DHVPODRYYIiAPALTWRI S PDTSLTLIjAS YMKNKTINNAGYPLEGTVKYNP 

NGRI PRHRFTGEPDWSKWDQEVANVGYQFAHRPMyiWQFKQNIiGYAQSRNRVNHAYWW 

VPGSDPSTAERGAYRRDDDAHGVSIDNQFEATWQSGRFRHNTLPGIiDYTETSFTRKQYAG 

YNNLAPIDPFDPAYGSDVIiLPAKPDTYTNEKRSQLGLYIjQIXJIKFDDKL 

DGSTLNKIiSGVinrRTGDNAFTWRTGIjIiYIJ^NGLAPYTSYSTSPQPQA^ 

DPTKGKQWEAGVKYQPNGSNSFITASVFEIiTRTNVPTTDPANPVYSVQEGEVRSRGIiEIiS 

ATANIiASGWNLIAAYTYTDAEITKSNSlSrrijGlSrrPEAVPRNMASLWSDl^^ 

GAGWYMGSTYNNTNAAKVGD YTVPDAALRYDFGARS PSLKGWTADLTVRNLFDKDYVAS 

CTYACFYGEGRTVLGRVTYKW 



SEQ ID NO:7 polynucleotide sequence of Orf4 

Atgacaggctttcatgcacgcaagccggtgggcggtggccatggccggcccgcgcacggc 
Aggccgcttgcgtggccgcgcatccccctacggacaaccaccatgaagccattaccgctc 
Gcttatctcgccgcgctgctgccctggtacgcaggcgtcatccaggcgcaatccgcgccc 
gccgccggcgacgatgcctcgatcaccctggaagccgtcagggtcgaggccagcgccgac 
gcctccgccggcggcctggcgccggccttcgcgggcggccaggtcgccacgggcgcgaag 
gtcggcatcctcggcacgcgcgacaacctggaaaccccgttctccatcaccgcctacacc 
aacgaactgatccaggaccgccaggccaagggggtgggcgacgtcctgcagaacgacccc 
ggcgtgcgggtggcgcgcgggttcggcaacttccaggagtcgtatttcatccgcggcttc 
atcctcagctcggacgacatcgcgtacaacggcctctatggcctgttgccgcgccagtac 
atctcgacccagctgttcgagcgcgtcgaggtgctgcgcggtgcctcggcgtttctcacc 
ggcgcgccgccgtccggcggcgggatcggcggggtgatcaacctggttcccaagcgcgcg 
cccaacgagccgctgacgcgcttttcggccggctacggcagcgacagcgtgctcgaggct 
tcggccgacatcggccggcgcttcggcccggacgacagcgtcgggatccgcatcaacgcc 
gcccagcgcggcggcgagaccgccatcgacggcgagcgcacccgcaccacggtgttcgcg 
ctgggcctggactggcgcggcgagcgcgcgcgcctgtcggccgatatcggctaccaggac 
aaccgcctgaagcgggcgcgccccaatgtcacgctggccggcgacgccgccaaggtgccc 
ggcgcgcccgacgccggctccaactatgcccagccctggtcgtactccaacgaacgcgac 
gtgttcggcaccctgcgcggcgaatacgacttcaacggccgcataacgggctgggtcgcc 
tatggcatgcgccagagcaaggaggagaactcgctggccaacctcaataacgtcaacggc 
STcggggcagggcaagttctaccgcttcgacaacgcccgcgaggataccgtcaacaccggc 
gagatcggcctgcgcgccaaggcgcgcaccggcccggtgggccacgaactggtcgcctcg 
gcgtcgtattfccgacctcgagaagaagaacgcctatgtcatggacttcttcaaccagttc 
gacaccagcatctacgaccccgtcagctacgccaagccggccatcagcagcaccgcgttt 
cgcggcaacgacatggacgatcccgccaagcagggcgtcatccggctggccagctatgcg 
ctSrSSfcgacaccatgtcgttcttcgacgacaaggtgctgctgaccgccggcatccgccac 
cagcgccrtctaccagcgcgactacagctacgacacgggcatcggcggcaccccctacgag 
caaagccacaiactcgcccgccgccggcctggtggtgcgcgtgacgccccaggtgtcgctg 
tacgccaactacatcgaggccctgtcggcgggcgacaccgcgccgcagaccgccaacggc 
ctgccggtggtcaaccacggcgaatcgctggcgccctatgtgtccaagcagaaggaagtg 
ggcgtcaagctcgagcacgacggcctgggcggcggcctggcgctcttttccaccgacaag 
ccgcgcgggttcgtgggcgatgaccaggtcttccgcgcttcgggcaaggaccgccaccgc 
ggggtggaantgacgact-tacggcgagctcacgcgcagcgtgcgcgtgctgggcgggctg 
acctggctggacgccaagcagctcagcaccggcaacgccgccaccgacggcaagcgcgtc 
atcggcgtgccccgcttccaggccaacctcggcgtggagtgggacatccccggGgtgcag 
ggcctgaccgtggacgggcgtgtggtctatacgggctcgtcctatgcggatgcggccaac 
accctcgaggtgccgggctggacgcgcctggacgccggcctgcgttacatgaccgatatc 
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ggcggccatctggtgacctggcgcgcccgcgtcgagaacatcgccaaccgcgactactgg 
tcctccgtgggcggctaccccggcaatggctacctggtgctgggcggcccgcgcaccttc 
acgctgtcggcatcgatggagttctga 



SEQ ID NO:8 polypeptide sequence of Orf4 

^ra3FHARKPVGGGHGRPAHGRPIiAWPRIPLRTTTMKPLPIJAYIAALLPW 

AAGDDASITLEAWVEASADASAGGLiAPAFAGGQVATGAKVGILGTRDNIjETPFSITAYT 

NEIilQDRQAKGVGDVLQNDPGVRVARGFGNFQESYFlRGFILSSDDIAYNGLYGIjIiPRQY 

ISTQIiPERVEVIiRGASAFLTGAPPSGGGIGGVINLVPKRAPNEPLTRFSAGYGSDSVLE^ 

SADIGRRFGPDDSVGIRINAAQRGGETAIDGERTRTTVFAIjGIjDWRGERARLSADIGYQD 

NRLKRARPmrriiAGDAAKVPGAPDAGSNYAQPWSYSNERDVFGTLRGEYDf^ 

ygi^qskeenslanlnnvngagqgkfyrfdnared 

asyfdlekkmyvmdffnqfdtsiydpvsyakpaisstaprgl©mddpakqgvirii^ 

LGDTMSFFDDKVLIiTAGIRHQRIiYQRDYSYDTGIGGTPYEQSHNSPAAGLVVRV^ 
YANYIEALSAGOTAPQTANGIiPVVimGBSIiAPYVSKQKEVGVKPEHDGIjG^ 
PRGFVGDDQWRASGKDRHRGVIBLTI^GELTRSWVIiGGLTWIJ^^ 
IGVPRPQANLGVEWDI PGVQGLTVDGRVVYTGSSYADAANTLEVPGWTRIjDAGIiRYMTDI 
GGHLVTWRARVENIANRDYWSSVGGYPGNGYIiVIiGGPRTPTLSASMEP 



SEQ ID NO:9 polynucleotide sequence of Orf5 

Gtgcctcgtcctacttcccgccgtacgcgccctgcgcgccggcaggcgcagcccgccttc 
Gtgcccgcgctcttcatgcttgccctgggcgccgttgccgccggcgcgcgcgcccagccc 
Gctgcggcgggtgtcccggatacgcaaggcgtggcgcaaatgccggccgtcacggtcaac 
Gccgcgccggtggacgacacgctggagcatctggaggcgccggtcgataccggcgcgctg 
Ggacggcgcacccagctggagacgcctttttccaccacggtggtgactgcccgcgacatg 
Gaggagcgccaggtcaacaagctgggagacgtgttcgcgctggatgcctcggtgacggac 
Aacagcgcgtcctatggcgcgtgggccagctacctgacggtgcgcggcctgccgctggat 
Tggcagaattcgtaccgcatcgatggccggccgttcctgagctacgtcacgacgctgccg 
Ttcgagcacttcgagcagatcgacctgctcaagggggcgtcgggcttcatgtacggtttc 
Ggctcgccgggcggcctggtcaactatgtcaccaagaagccgaccgacgaagcggtgcgc 
Agcgtcgagctgggctacgtgtccaaggggttgctgcgcgagcacgtggacctgggcggc 
Agggtgggccagagcggcgcgtttggctatcggctgaatgccacgcacgaggagggcaat 
Acc t acaacggt gga t cgc t gt ac cgcgat t cggt gt cgc tggcgc t ggat gcgcgc c t g 
Agcgaccggttgacct gggact t ccaat ccat ct accaggaccgcaaggccatcgggcag 
Gagcccacgatctatgcgggcaccatggccggcagcgagttgccatcgccggtgcgcaac 
Gacaatgacaggctggtcgggcagggaccgtatgcggacaatgcgttccgctattactcg 
Accggcttgaagtaccaattggcggacgaatggacgctgagcaccaattacagctacagc 
Tccacgcgtacccgccgcaacgagtcggtgctgttcctgcgcgaccaggcgggcgactat 
Gacgattaccgctcggactatggcgaggcctatggctacaaccagtggcaggccatgctg 
<3asr9gcaagttcgctaccggtcccttgaagcaccacgtggtggccggcgcgtcgtggcag 
Aagcagaagaacgactacagcgccaacggggtctatcaattgcagggcacgggcaacctg 
Cgcgcgcgcaataccaacacgtactacagcgaaggccagctgcacctgtaccgcgcggcc 
Gagatcacgcagaaggcgctgttcgccagcgacacggtcgacctgaccggcggctggtcg 
Gtgctgggcgggctgcgctatacgaattatgcgcagcaaggattcgatgccacgggcgcg 
Cgaacatcgcgctacgacaagaacggcgtgctgacgccgacctttgccctgatgtacaag 
Ctgacgccgcgcaccatggcctatgccagctacatcgaatccttggagccgggctcgtcg 
Gtgggcgccgcgtacgccaacttcggcgcattgctcgatccgttgaagagcaagcagtac 
Gagctgggcatcaagaccgaacaggacggctgggccgccacggcggcgctgtttcgcatc 
Gagaagaaggcggaatacgcgaatgccgccaacgagctggtgcaggacggcaagacgctc 
Tatcaggggttggaactgggcgcctccacgcgtatcgcccgcgactggaacgtgggaggc 
Agcctgatgttgctggattcggaatacaagaaaggctcggatttcaccggcaaccgcgtg 
Gcgggagcgccgaagttcgtggcggccgcgcaactggcgtactcggtgccgcaggtgccg 
Gggctgaagctgcgcgccgatgtgaagtacaccggcaacacgatgctgggcgccagcaac 
Cgggtgcaggtggacgactacgccatcgtcaatatcggcgccacctacgacacgcagatc 
Cacggctacgaggcgaccttcaccgccggcatcaacaacgtggccaacaagcgctactgg 
Ctgtaccagtcgtctgactacgtgaaggcgggcgacccgcggacctatggcctgacgtct 
atatccagatatccggatatctggatataa 
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SEQ ID NO:10 polypeptide sequence of OrfS 

VPRPTSRRTRPARRQAQPAFVPAIaFMLALGAVAAGAI^PiUAGVPDTQGVAQMPAVTVN 
AAPVDDTLEHLEAPVDTGALGRRTQLETPFSTTVVTAiyDMEERQVN^ 
NSASyGAWASYLTVRGLPLDWQNSYRIDGRPFLSYVTTLPFEHFEQIDLLKGASGF^^ 
GSPGGLVKrrVTKKPTDEAVRSVELGYVSKGLIiREHVDIiGGRVG^ 

TYNGGSLYRDSVSIiALDARIiSDRIjTWDFQSiyQDRKAIGQEPTIYAGTMAGSELPSPVR^ 
DNDRLVGQGPYADNAFRYYSTGLKYQLADEWTLSTNYSYSSTRTRRNESVLFL 
DDYRSDYGEAYGYNQWQAMLEGKFATGPLKHHWAGASWQKQKNDYSANGVyQLQGT^ 
RARNTNTYYSEGQLHLYRAAEITQKALFASDI^LTC^ 

RTSRYDKNGVLTPTFAIJiiyiaiTPRTMAYASYIBSLEPGSSVGAAYANFGALL^ 

EMIKTEQDGWAATATU^FRIEKKAEYANAANELVQDGKTLYQGLELGASTRIARDWN^ 

SIJMLLDSEYKKGSDFTGNRVAGAPKFVAAAQIiAYSVPQVPGIiKLRAD^^^ 

RVQVDDYAIVNIGATYDTQIHGYEATFTAGINNVANKRYWLYQSSDYVKAGDPRTYGIiTS 
ISRYPDIWI 



SEQ ID NO:ll polynucleotide sequence of Orf6 

Ttgcccgccataagcgtcacgggtcgtgagatttccgacctcaccgagggtacaaacgcc 
Tacacaaccgaggccatgagcacggccacgggcctgacactctcgccacgcgaaacacca 
Caatccgtcagtgtggtcacccgacagcagatcgaggatcagggcctcaccgacaccggc 
Gccatcctggcgaccgcgccagggatttccgtcacgcgcagcgacagcaaccgctattca 
Ttctcggcccgcggcttcaccatcgacaacttccagtttgacggcctggtatcgcccatc 
Ctgagccaatggaactatggttcgaccgatatggacgccgccatctacgatcacgtggaa 
Atcgtacgtggcgccacaggcctgatgacaggctcgggcaatccttcagccgccgtgaac 
Ttcgtgcgcaagcgtcccttgcgtgagttcgcggctacgttcaatgcgagtgtcggcagc 
Tgggactatgtgcgcggcgatgccgacatctccgtgcccatcacggaagacggcagaata 
Cggtcacgcttggtggccgcctacagtcagggcgacagctatgtgcactttttagatacg 
Cgccggcgcacattctatggcgtggtcagcgccgatctgacgccggatacggtgctgacg 
Accagcgtggagtaccagcacaaccacagcaatgggtttggcagcggctttccgctgttc 
Tatagcgacggttcgcgcaccgatttcaaccgctcggtggccaacaacgctccctgggcc 
Cggcaggataccgaagccaccacctatttcgtggacctcacgcaccgcttcaccaatgac 
Tggaagctgcgcgcggcctatagccacactgatggccgctatctcatgaaacatgtgtac 
Cggggcggctaccccgatcgccatactggcatcatcgctgccccccctgcattttccaac 
Tacgacggcaacctcgatcgggatgacatccatttttccttgtccgctcctttcgaggcc 
Ttcggcctgcgccacgaagttgccctgggctggatgagcatcgacaaccatagcgacatc 
Cagcgatacgcaatggtcggaccggccccagccatcggcagcttcttcgactggcgccgc 
Gcccacatccaagagcccagctgggccgacacgctgtcgcccgccgacgacgtgcgcacc 
Aagcagaccggcgcctatctggtcggccggtttgcactagccgaacccctgcacctcatc 
Gtg^gcgaccgttggagcgactggaaaaccaaacagatgtattttggctcgcgccgcgaa 
Tacaggatcaagaatcagttcaccccctatgccggtctgacctacgacatcaacgacacc 
Tacacggcgtacgccagctatacggagatcttccagccgcagaacgcgcgcgacaccagc 
Ggcggcattcttcctcccatcaaaagcaagagctatgagctgggtctgaaggcagcctat 
Ctggagggacggctcaatacctccgccgcgctctttcagacgcggcaggataacctggcc 
Caggtcatcccgggctcatccattccgggctttccgaacatgcaggcctcacgtgccgcc 
Tccggcgccaaggtcgaggggatagacctggaggccagcggccagatcctgcccgactgg 
Aacatcggcgccagctatacacacttcaccaccaaggacgccagcggcaaccccatcaac 
Accaatcatccgcgcagcctgttcaagctctacaccacgtaccgcctgccgggcgccctg 
Caccggcttaccgtgggcggcggcgttgactggcaaagtcgcatgtaccaggccgcagcc 
Agtccgcgcggcaatgtcgaagtcgaacaggacagctacgcactcgtgagcctcatggcg 
Cgcttcgactttaacaaaaaactgtcggcaacactgaacgtgaacaatctgttcgacaaa 
Aagtactacgatcagatcggcttctacagccagggttggtggggtgcgccacgcaatgta 
atgctcaacttgcgggcgcagtattga 



SEQ ID NO:12 polypeptide sequence of OrfS 

LPAISVTGRBISDLTEGTNAYTTEAMSTATGLTLSPRETPQSVSVVTRQQIEDQGLTOT 

AIIATAPGISVTRSDSNRYSFSARGPTIDNFQFDGIiVSPIIjSQWNYGSTDMDAAIYDHVE 

IVRGATGLMTGSGNPSAAVNFVRKRPIiRBFAATFNASVGSWDYVRGDADISVPITEDGRI 
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RSRLVAAYSQGDSYVHPIiDTRRRTPYGWSMriTPimniiTTSVEYQHNHSNGFGSGFPLF 
YSTCSRTDFmSVANNAPWARQDTEATTYPVDLTHRFTN^ 

RGGYPDRHTGIIi^PAFSNYDGNLDRDDIHFSLSAPPEAFGLRHBVAIjGWMSIDNHSDI 

QRYAMVGPAPAIGSFPDWRRAHIQEPSWAlOTjSPADDVRTKQTGAYriVGRPALAEPIiHLI 

VGDRWSDWKTKQMYFGSRREYRIKNQFTPYAGIiTYDINDTYTAYASYTEIFQPQNAROTS 

GGILPPIKSKSYEMIiKAAYLEGRLOTSAALFQTRQDNIiAQVIPGSSIPG^ 

SGAKVEGIDLEASGQILPDWNIGASYTHFTTKDASGNPINTNHPRSLFKLYTTO 

HRIiTVGGGVDWQSRl^QAAASPRGNVEVEQDSYALVSIiMARFDFNKKLSATI^^ 

KYYDQIGFYSQGWWGAPRNVMLNIiRAQY 



SEQ ID NO: 13 polynucleotide sequence of Orf7 

Atgaagttctactcfctcccatccgatgcccgagtcgctcgcggctgcgatcgcagtgcct 
Ctgttgggcctgctgccggcggcgcaggccgcgtccacggcggtccagctgccatccgtc 
Acggtcgagggcgagtactcgtcctatcaaccggaaagcgcccagtcgcccaagttcacc 
Gcgcccctggcggacacgccgcgcacggtgcaggtcatccctgagcggctcatccaggac 
Cagggggccagcgacctcgaagcggtactgcgcaatgcgccagggatatcgatgaccgcc 
Ggcgaaggcggccgtccggccagcgacctgccgttcatccgcggccagaattcggccagc 
Agcctttttgtcgacggcctgcgcgatcccagcacgcaatcgcgcgataccttcaacctg 
Gaacaggtcgacgtcgtcaaggggcccgattcggtattttccgggcgcggcggcgccggc 
Ggaagcatcaacctcgtcaccaagacgcccaggaaccaggatttcaccgaagtccaggcc 
Ggcatcgggacggccgagacctaccgaggcaccatagacggcaactgggtgctgggcgag 
Aacacggcgctgcgcctcaacctgctgggcaccagggacaccgtgccgggccgcgacaag 
Gcggtcgagttcagccgcgtgggtatcgcgccatcgctgcgcctgggcctgagcggcccc 
Acccgcgtgacgctgggcctgtaccactatcgccaccggcgggttcccgattattcgatt 
Ccgtacgatccgcgcaccggcacgccgatcaccgagaccatcggcgtcagccgccgcaac 
Ttctacggcctggtgcggcgcgactccggcgataccgaggactacgccgccaccgtcaaa 
Tgggagcacgacctcgccaatggcttcaaggtggagaacctggcgcgctactcgcgtgcc 
Acggtggagcagatcaccaccatgcccgaactgaaaaccgccgatctggccaaggggctg 
Gtgtaccgcaatctgcgcgccagctaccaggtcaacgacagtttcgccaaccgcaccgac 
Ctgcgcggtacattcgacacggggcagtggcgccataccttcgatctgggcggggagttc 
Gccaccagccggcgcagtcgcgaccgctacaagcaagaaatccccgacgccgccagtcct 
Tgctcgcccgtgacggacggcaacaatcccgccctgtgcgcctcgctccgggatccggat 
Ccgcacgtggatttcccgggaacggtgcggcgcaaccataacccggcccgctaccacacc 
Gacatcctgtccctgtacggtttcgacaccatcgccttcgacgagcagtggcagctgaat 
Ctcggcctgcgctgggaccactacaagaccagcggacgcaacctgccggtacgaggcgcc 
Aagccgcccgtctacgagcgtgccgcgcgcaccgacaacctgttcaactaccagctcggc 
Ctggtctacaagcctcgtccggacggctcggtgtatgcgagttacggcacggcgtccacg 
Ccgtcggccgtgtccgactacgccccggcggacagcatctccggcacaagccagcagctc 
Aagccggagcgcagcgaggcgatcgagatcgggaccaagtggcaggtgctggaccggcgg 
Ctgctggtgacgggcgccatgttccgcgagacgcgcaagaacaccagcatcgaagtcgcc 
Gaaggcctgcgcgcaccagccggcaagagccgcgtcaccggcatggagctgggcgtggcg 
Ggcagcctgacgccgcgctgggacg.tctacggcggctacgcgctgctcgacagcaagctg 
Gtcagggccagccataagagcggggcgcaaggccagccgctgcccagcgcgccccggcac 
Gcattcagcatctggagcacctacaagctgctgccggaactgaccgtgggggccggcgcg 
Ttctatcgcagcaaggtctatggcaacgcagatgccggctacaacaaggacggcacgccc 
Aaggcgcgctgggtgccggcgtactggcgcttcgacgccatggcggcgtaccagcttaac 
Aagcaccttacggcccagttgaacgtctacaacctgctcgacaagacctattacgccaag 
Acctaccgcagccattacgcggcgctgggcccggggcggtccgccatgctgacgttcaag 
ctgagctactga 



SEQ n> NO:14 polypeptide sequence of Ori7 

MKyYSSHPMPESIiAAAIAVPLLGLIiPAAQAASTAVQLPSVTVEGEYSSYQPESAQSPK^ 
APLADTPRTVQVIPERLIQDQGASDLEAVLRNAPGISMTAGEGGRPASDIaPPIRGQNSAS 
SLFVDGLRDPSTQSRDTPI^EQVDVVKGPDSVFSGRGGAGGSINIjVTKTPRNQDFTEVQA 
GIGTAETYRGTIDG2SrWVIiGENTALRI*NIiIiGTRIH^ 

TRVTIjGIiYHYRHRRVPDYSIPYDPRTGTPITETIGVSRRNFYGLVRRDSGDTEDYAATVK 
WEHDIANGFKVBNIiARYSRATVBQII^PEL K T ADIjAKGL ^ 
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IiRGTFiyiXSQWRHTPDLGGBFATSRRSRDRYKQEIPDAASPCSPVTDGNNP 
PHVDFPGTVRRiraOTARYHTDIIjSLYGPiyriAPDEQWQLNLGI^^ 

KPPVYERAARTD^^^FNYQIJGIJVYKPRPDGSVYASYGTASTPSAVSDYAPADSISGTSQQL 
KPERSEAI EIGTKWQVLDRRIiLVTGAMFRETRKNTS I EVAEGLRAPAGKSRVTGMELGVA 
GSLTPRWDVYGGYALIiDSKLVRASHKSGAQGQPLPSAPRHAFSIWSTYKLLPELW 
FYRSKNTfGNADAGYNKDGTPKARWVPAYWRFDAMAAYQLNKHLTAQIj^^ 
TYRSHYAALGPGRSAMIiTFKIiSY 

SEQ ID NO:15 polynucleotide sequence of OrfS 

Atggagaagccgttgaaatccctggactcgtattcagcgagcacgctcgccaactcgctg 
Gccgccgccattgcggtgccggccctgtgcctgatgcccggtgctcaggcacagaccagc 
Gcgggcgttacccaattggcgccggtgcaggtagagggcgaagcgtccccctatcaggcc 
Accaccgtccagtcgtccaagatgacggcgcccttgctggatacgcccaggaccgtgcag 
Gtcgtgccgcagcaggtcatccaggaccaggccgccaccaatctgcaggacgtgctgcgc 
Aactcgccgggcatcaccatgggcgccggcgagggcgggcgcgccggcggcgacctgccc 
Atcatccggggccagaatgcggcgggcagcatcttcgtcgacggcgtgcgcgaccccagc 
Acccagatacgcgatacgttcaacctggagcaggtcgagatcatcaaggggcctgattcg 
Gtctactccggccgcggcggagccggcggcagcatcaacctggtcagcaagacgccgaag 
Gcgcgcgacttcgccgagggctcggtgcagatcggcaccgacagcaattaccgcgccacc 
Gccgacggcaactggctgctgggcgacaacgccgccttccgcctgaacctgatgggcaac 
Aagggcgacgtgccgggccgcgaccatgcggtcgatttcagccgctggggcgtggcgccc 
Accctgcaactgggcgtgggcacgcccacccgcatcaccctggggtactaccactaccag 
Gatgacagcatgcccgattacgcgatcccgtacgatccgaagtcggggcagccggtcacc 
Gagacccagggcgtcagccgcaagaatttctacgggctgaccggccgcgacttcatgaag 
Tcgcgcgacgacgtggccacgctggccatcgatcacgatttcagcagcaagctgcgcctg 
Cgcaacgtcacccgctacgggcgctcggtgaccgactacgccgccaccaatccggatgac 
Agcaagggcaacgtgcccaacgggctggtgtaccgggcgctgaaggcgggctactacacc 
Aacaagacgttcaccaaccagaccgacctgagcggcgaattcgagacgggcagcctgcag 
Cactcgttcgacgtgggcttcgagtacagcaacatcaagcaggacaaggactcgtatacc 
Cagactatcgccaagggcgcgatgccttgcaaggtgggcgccaacgatgccagcaatccg 
Gccttgtgcacctcgctgtgggatccggatccgcatgactattatcccggccacctgtcg 
Cgcaacgacaacccggcccgctattccaccgacacgatcgcgctctacggcttcgacacg 
Atcaagttcaacgagcaatggcaggccagcgtcggactgcgttgggacaattaccgcgta 
Agcggcagcaatatcgcccgcggccgcaacgatcccgccagcacgccggcgttctacagc 
Accagccgcgaagacaatctgttcaattaccagctgggcctggcctacaagccggtgccc 
Aacggcacgatctacgcctcgtatggcacctcgtcgacgccgtcggccgtcgccggctcg 
Aa cgt gagcga cgccg t ga cgc^t gagcaacgagt cgc t ggcgccggagaaaagccgcacc 
Gtcgaggtcggcaccaagtggcaattgttcgacgaccgcctgaccctgtcgggcgcgttg 
Ttccaggacatccgcaagaacaccagcgtggccgtgtcggcgaccgaaacggagcagatc 
Ggcaaggccaaggtgcgcggcatcgaactgggcttctcgggcagcatcacgcccaagtgg 
Aacgtctacggcggctataccttcatggacagcgaactggtcgagggcgcctacaacagc 
Ggcgcggtgggccaggacctgcccaacacgccgcgcaatgccttcagcctgtggaccacc 
Tacaagctggtgcctcagctgaccgtgggcggcggcgcctattacgtggacaaggtatat 
Ggcaacgcggacaacggtcgcaatgccgacggcacgccgaaggcgcgctgggtaccgtcg 
Tactggcgcttcgacgccatggccgcgtacgagttcaacgaccacctgaccgcgcagctc 
Aacgtgatgaacatcttcgacaagacgtactacaccaaggcctacgcggcgcactacgcg 
gcgctgggcacgggccgcgccgcggtgctgtcgttcaatatcaagtattga 



SEQ ID NO:16 polypeptide sequence of OrfS 

MEKPLKSrjDSYSASTLANSIiAAAIAVPALCLMPGAQAQTSAGVTQLAPVQVEGE;^ 
TTVQSSKI^APIiliiyrPRTVQVVPQQVIQDQAATK^ 

I IRGQNAAGS I FVDGVRDPSTQI RDTFNLEQVE 1 1 KGPDS VYSGRGGAGGS INLVSKTPK 

ARDPAEGSVQIGTDSNYRATADGNWLLGDNAAFRIjNIjMGNKGDVPGRDH^ 

TliQLGVGTPTRITLGYYHYQDDSMPDYAIPYDPKSGQPVTETQGVSRKNFYGLTGRDFM^ 

SRDDVATIiAIDHDFSSKliRLRNVTRYGRSVTDYAA™PDDSKGWPNGLVyR^ 

NKTFTNQTDLSGEFETGSIiQHSFDVGFEYSNIKQDKDSYTQTIAKGAMPaCVGAN^ 

ALCTSLWDPDPHDYYPGHIiSRNDNPARYSTDTIALYGPDTIKPNEQWQASVGrjiWDN^ 

SGSNIARGRNDPASTPAPYSTSREDNLPNYQLGLAYKPVPNGTIYASYGTSSTPSAVAGS 
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NVSDAVTVSNESIiAPEKSRTVEVGTKWQLPDDRLTLSGAIiTODIRKOTSVAVSATETE^ 
GKAKVRGIEIjGFSGSITPKWNVYGGYTFMDSELVEGAYNSGAVGQDLPNTPRN^ 
YKLVPQIiTVGGGAYYVDKVryGNADNGimADGTPKARWVPSYWRFDAMAAYEFN^ 
NVMNIPDKTyYTKAYAAHYAALGTGRAAVLSFNIKY 

SEQ ID NO:17 polynucleotide sequence of Orf9 

Ttgcatactcgcacgccacagcggcagcgcccggtcgcgccgcgcctgctgcatctatcg 
Ctggccgcctcgctggcggccggcgccgcccaggcgcagaccgccaccgaagccaccacc 
Ctgcccaccgtgcaggtcaccggcaggggcgagaccgccaccggcccggtcgacggctat 
gccgccacgcgcagcgccaccgccaccaagaccgataccccgctgtcggaaacGccgcag 
gccgtcacggtgatcccgcgcgagcagatcatcgaccagggcgcgcagaacgtgcaggac 
accatgaactacgccgcgggggtgcgccccaacgcctatggcgtggacaaccgcggcgac 
tacgtgcgggtgcgcggggtggagccggtccagtatctcgacggcctgaagcagttcttc 
aactacaacaatccgcgcaccgaggtctacgggctcgagcgcgtcgaagtcctgcgcggc 
ccggcctcgatgctgtacggccagggcagcaccggcggcgtggtcaacctcgtcagcaag 
cggccgcagcccgaggccatgcgcgagatcggcgtgaccgtgggcaaccacaaccgcaag 
gagatccaggccgatctcaccggcccgctgaccgaggacggcacctggctgtaccaggtg 
gtggccctcggacgcgacagcgacacgcaggtccagtacaccaaggacgaccgcatgatg 
ctcgcgccgtcgctgacctggcagcccagcgccgccacctcgctgacgctgcaggcctac 
tggcagaaggacaagtcgggcacgacccaggccttcctgccctggagcggcacggtcagc 
ggcaaccccaacggccgcatccccacccgccgcttcaccagcgaacccggcttcgaccgc 
tacgacaccgagcaattcagcgtgggctggcagttcgagcacaagttcaacgacaactgg 
aaagtgcgccagaacctgcgccacacatccagcaaggtcgactacagcacgctgtatccg 
gcggtctacggcaaccccgacaatcccttcatcgacgccgaccagcgcgtggtcaatcgc 
tacctgtacatcaagaacccgcgcatgcgttccttgctggccgaccagaacctcgaaggc 
aaggtgaactggggccgcgccgaacataccctgctgatgggcgtggactacagccgctat 
cgcgagaccggcgagaccggcagcgggttcggcgcgccgctggacctgtaccagccggtc 
tacggcaccctgcccgactatgccatgtcggacgtgcccaagaacaagcagcagcagatc 
ggcgtctacctgcaggaccagatcaagttcgaccgcaactggatcgtggtggcgggcctg 
cgccacgaccgcgtcgccaacagcgtcgagggcgccgacaaggaaaccgacaacgccacc 
accaagcggctgggcctgatgtacgccgccgacaacggctggtcgccctacctcagctac 
agcgagtcgttcacccccatcgccggcaccgacaacagcggcaaccgctgggtgccgatg 
cgcSSrcsagcaatgggaagcaggcctgaagtacatgccgcaggacaccggctatgaggcc 
accctggcggcctacgacctgcgcgagcgcaaccgccagaccaacgacccgtccgatccc 
accaaccaggtgcagaccggcaagaccaagacgcgcggcatcgaactggaattccgcggc 
cgcgtcacgccgcagatggatgtgatcgccaactacaactacaccgacatcgacccgcag 
ctcgaaggcctgcccaagcacacgttctcgctgtggagcaaataccggttcagcgtgggc 
gatgtgcatggctttgccgccggcgccggcgtgcgctacctgaacgcgtttcgcgacggg 
tccgcgcccgagaccggctcggtggccctgttcgacgccatgctcagctacgacaccggt 
tcgtggcgctatgcgctgaacgtcgccaacatcgccgacaagacctacgaggtggtgtgc 
ctgcggcgcggcgattgcttctacggccagcgccgcacggtcaccctgagcgccatgtac 
cgcttctag 

SEQ ID NO:18 polypeptide sequence of Oii9 

LHTRTPQRQRPVAPRLLHIiSIiAASIiAAGAAQAQTATEATTLPWQV^ 

AATRSATATKTDTPLSETPQAVTVI PREQI IDQGAQNVQDTMNYAAGVRPNAYGVDNRGD 

YVRVRGVEPVQYLDGLKQFPNYNNPRTEVYGLERVEVLRGPASMLYGQGSTGGVVNIjVSK 

RPQPEAMREIGVTVGNHNRKEIQADLTGPLTEDGTWLYQVVALGRDSDTQVQYTKDDRiy^ 

LAPSLTWQPSAATStiTLQAYWQKDKSGTTQAFLPWSGTVSGOTNGRIPTRRFTSEPGFDR 

YDTEQFSVGWQFEHKFNDNWKVRQNIJlHTSSKVDYSTIjYPAVyGNPDNPPIDADQRVVNR 

YLYIKNPRMRSLIiATONLEGKVNWGRAEHTLLMGVDYSRYRETGETGSGFGAPLDL^ 

YGTLPDYAMSDVPKNKQQQIGVYIiQDQIKFDRNWIWAGIiRHDRVANSVEGAD 

TKMjGIiI^AADNGWSPYLSYSESFTPIAGTDNSGimWVPMRGKQWEAGIjK^ 

TLAAYDLRERNRQTimPSDPTNQVQTGKTKTRGlELEFRGRWPQMDVIANYl^^ 

LEGLPKHTFSLWSKYRFSVGDVHGFAAGAGVRYIiNAFRDGSAPETGSVALFDAMLSYDTG 

SWRYAr.NVANIADICrYEWCriRRGDCFYGQRRTVTLSAMYRP 

SEQ ID NO:19 polynucleotide sequence of OrflO 
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Atggtttacgcttgcgatcagggcgcccgccgcgcccgtcccgcgccgccaaggcgcccc 
Caaacggcactggccatgcgcggcgcgctggcggcatgcgcactggccggtacgctggcg 
Gccgctcccgccgccgcgcagccgacggcggcgcccgcatcggcgggcgcgcgcgcctgg 
cacatcgacgccggccccctgggcgaggccctggcgcgctttgccgaccaggccggcatt 
accctgctgtacgaccccgccgcggtgcgcggccgcgccagcgccggcctgcaaggcgtg 
tactcggtgcccgacggcctggcgcgcctgctcgatggcagcggcctggacgcgcgccag 
cgcggcgccggcacctacgtgctgcaggcgctgcccgccggcccggtcgcccagctggcg 
ccggtcaccatcgaggctgacggcgtgcgcgccgatcccgcctgggcccgcaccgccacg 
cgccgcgagctcgacgcgcgccaggtgctcgactggagcgatatcggcaagcgcgtcgat 
cccggcgtcaactacaaccgccgcaccaagagcatcaacatccgcggcctggacgaaaac 
cgcgtggtcacgcgcatcgacggcatccgcctgccctggctcgacgacggcgcgcgcggc 
atccagggcgggctgaacgcggtggacttcaacaccctgtcgcgcctggacgtcgtgcgc 
ggcgccgactccagcgcggccggctccggcgcgctgggcggcctggccgacctgcgcacg 
ctcgaacccgccgacctgctgcgcgacgggcgccgcttcggcgcgctggccaagtccgac 
tatgactcggccgacgccagctggggcctgaacgcggccctggccgggcaggtccacgac 
gacaccagctggctgttgcaggcgggcacccgcaatgggcacgacctggacaaccgcgcc 
gacacgggcggctacggcagcaagcgcagccagcccagccccgaggactacgcccagaac 
aacttcctgctcaagctgcagcagcgcatcgacggcggccatcgcctcggcctgacgggc 
gaatacttcaagcgccgcgccgacctcgaccagatgtaccagcagggcgccggcaccagc 
taccagtacggcgccaaccgcacccacgaggaaaccacgcgcaagcgcgtctcgctggac 
taccagtacaacgccccgcaggccggcgccgcgatcgacagcgcccgggccatggtgtat 
tggcagcggctgcggctggacagctcgcaggacgcccgccgcacgcgcgacgggcgcgcc 
tacgcccgccccggcgacccgtacttctacggctaccccagcggcccctacgggcgcagc 
aactcgatccaggaatcgatcctcggcgtcaacggcgagctctccagccgcttcgaaggc 
atggtgtcgcagcgcgtgacgataggcggcgaatggtacggcaaccgcaccgagcagtac 
tcggacggctacgacaactgccccgccatcccgcccggcacgcccgcgccgatggggccg 
cgcctgtgcgacatgctgcataccaaccaggccgacatgccccgggtcaagggcagccag 
tgggccatctgggcgcaggacgaaatcgccttcgccgacgggcgctacatcctgaccccg 
tcactgcgctacgaccattacgagcagaagccgcagcaaggcggcggctaccagaacaac 
cccaacgccggcgcgctgccgccgtcgtcgtcggggggccgcttctcgcccaagctgctg 
ggcacctggaaggcgcgcgaggcgctgacgctgtatgcgcaatacggcttcggctaccgg 
gcgccgtcggccaccgagctgtacaccaactacggcggcccgggaacctatctgcgcgtg 
ggcaatccctccttgaagcccgagaccagcaagggctgggaactgggcgcccgcctgggc 
gacgaccagttgggaggcgccgtatcgctgttcgacaaccgctaccagaacttcatcgac 
aagaacgtgccgctgggcaagggttcgccgcaatggcagccggcctgggacggccagtac 
ccgctgggcgtcaccgggctggccaaccgggcgcgcgtgcgcatctatggcgccgaagcc 
tcggcgcactggcggttcgcgcccaactggcgcacctggggctcgctggcctgggccgtg 
ggcaaggacgaaaacaccggccagcacctgaattcggtgccgccgctcaaggccatcctc 
ggcctgggctaccagcgcgacgaatggggcatcgacgccatgctgacggccgccacgcgc 
cgcgacgacgtgcaataccccgaggcctccgccagcgcgcgctacgccgatttccaggcc 
ccgggctacggcgtggtggatctgtccgcctactggcgcccggccgccgtcaagggcctg 
cagctgcaggcgggcgtgttcaacctgttcgacaagaaatactgggaagccatcaacgtg 
cccacggcgggtgccattgcgattccgcgaccgttagactggtacaacgagccaggccgc 
agcg t gcgcgt at cgt t gacc t acc agt at tga 

SEQ ID NO:20 polypeptide sequence of OiflO 

MVYACDQGAIUy^PAPPRRPQTALAmGALAACMiAGTI^^ 

HIDAGPLGEALARPADQAGITLLYDPAAWGRASAGLQGVYSVPDGLARLLDGSGIjD^ 
KtJAGTY VLQALPAGPVAQIiAPVTIEADGVRADPAWARTATRREIjDARQVLDWSDIGKRVD 
HSSamiEETKSINIRGIjDENRVWRIDGIRLPWIJDI^ 

GADSSAAGSGAIiGGLADLRTLEPADIJJRDGRRFGAIiAKSDYDSADASWGIjNAAI^ 

OTSWIjIjQAGTRNGHDLDNRADTGGYGSKRSQPSPEDYAQNNFIiLKI^ 

EYFKllRADLDQiyr^QQGAGTSYQYGANRTHEETTRKRVSLDYQYNAPQAGAAIDSARAMV^ 

WQRIiRLDSSQDARRTRDGRAYARPGDPYFYGYPSGPYGRSNSIQESIIiGVNGELSSRFEG 

MVSQRVTIGGEWYGNRTEQYSIX3YDNCPAIPPGTPAPMGPRliCDMIiHTNQADMPR^^ 

WAimQDEIAFADGRYIIiTPSLRYDHYEQKPQQGGGYQNNPNAGAIjPPSSSGGRFSPKLL 

GTWKAREALTLYAQYGFGYRAPSATELYTNYGGPGTYLRVCMPSLKPETSKGWBLGARM 

DDQLGGAVSLFDNRYQNFIDKNVPLGKGSPQWQPAWDGQYPIXSVTGI^^ 

SAHWRFAPNWRTWGSIJAWAVGKDENTGQHIlNSVPPLKAILGIJGYQRDEWGIDA^I^ 
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RDDVQYPEASASARYADFQAPGYGWDLSAYWI^i^VKGLQLQAGVFNIiFDKI^^ 
PTAGAl AI PRPLDWYNBPGRSVRVSLTYQY 

SEQ ID NO:21 polynucleotide sequence of Orfll 

Ttgcggccgggccggcggcgcgccgcgcgctgcccgtcatcgacatgcccgccggctgcc 
Ggcgcggcgccacaacgtgtgatcatgaaacagacttccctttactacgccaccctgggc 
Ctggtcggactggcgctggccgcgcccgcgcgcgcgcaggagcaatcgcttcccgtccaa 
ctcgcgccggtggtcgtgcatggcgcgcccgaggccaacggcccgctgaatctcgacgcg 
gtcgacagcaccggcagccgcctgggcctgaccctgcgcgagacgcccgcctcggtgacc 
gtcatcaaccgcgagcagatcgaggcgcgcggcgcgctcgacacgcaggaaatcgcccgc 
ggcatcgtcggcgtggacaatgcctcgccgcccggctcggccggctcggtgagctaccgc 
ggt 1 1 c t cggg 1 1 cgc aggt cagccagt t gt t caacggca 1 1 1 egg t gcagt acgacgt g 
gtcgccgcgcgtccgatcgacagctggatctacgaccgcgtcgaagccatcggcgggccg 
tccagcttcctgttcggcgcgggcgcggtgggcggcgccatcaactacgtgaccaaggtg 
gcgcagcgcga t acgttctacga cggccagc t gcgcc t gggt t cgt acggcgcgcgc cag 
gcatccgtgggccttaaccggcaattggccggcgagccgggcgggcgcggccagtacctg 
cgcatcgacgccaacgccaacgcgagcgacggctgggtcgacggcaatcgctcgcacgcc 
gagcaggtggcggcctcgctgctgtcggacctgggcgaacgcgtgacccatacgctggcg 
ctggagtaccagcacgagatggtgcaccggccttactggggtacgccgctgaccaccgac 
ggcgacggcgtggtgcgcggcgaaggccacatccgcggcgggacgcgctggaagaactac 
aacgtcgacgacggccggtacgagcaatcggtgtggtggctgcgttcgctgaccgaatgg 
caggccagcgaccgcctgagtttccgcaatacgctgtactactatcgcgccgatcgcgat 
ttccagaacctcgagacctaccgctacaacccgggcaacagccaggtgctgcgctccggc 
gcgctgctgcagcgccacgagcagcgcctgctgggcaaccgcatcgaaggcctgtaccac 
ggcagcctgggcggcctgcgcagcgactggtcgttcggcgccgactacagcgtcaaccgc 
cagacgcgctaccccaccagcgtggccgggcaagtcgatagcgtggacccgtacgagttc 
gacccgggcgagttctacgacattccgggcatgcggcgcggccacgtgcccgaccgcgac 
aacaaggtgcgcacgctggccttcatgctggaaaaccgcaccgaagtgggcggcggggtc 
gcgctggtgacggctctgcggcacgacatcatcgacctggacctgaccaaccggcgcgcg 
gccagcgcggcttcgcccgggcacgcctcgcgccgctacaacccgaccacggggcgcgtc 
gccgtcaattgggaggtcagtcccggcgcgaccctgtacgcgcaatacgccaccgccgcc 
gacccgccttccggcgtactgt:cgaccgcgaccttcgccgatgtgctgaacaacgacaag 
ctgaccaccggcacccaggtcgaggccggcggcaagttcgcgttctgggacggccgcggc 
acggcgaccgtggcggtctacgagatcaagcgcaagaacctcgccacgcccgatcccctc 
aaccccggcagcagcctgccggtgggcagccagtctgcccgcgggctggagctggccggc 
ggat t gcagt tgacgcgcgccttgtcgctgcaggccaacctggcgctggtcgacccccgc 
tatgacgatttctcgcagaacgtcggcggggtggcggtctcgcgcaacggcaaggtgccg 
gtcaacacgccgcgccggctggccaacgtgtggctggactacgccttcgcgcccgactgg 
cgcgccagcctggcggcgcgccacgtgggcaagacctatgcggacgcggccaatacggtg 
tgggcgccggcctataccgtgttcgacgcggcgctgtcgcatcgcatcgaccgccatttc 
agcgtgacggcgcgggtgcgcaacctgaccgacaaggtctatgccgccagcgtgaccggc 
gcgcccatgtattacctgggcgcgccgcgcagcgtcgaactcgcgctgcaggcgcgcttc 
tga 

SEQ ID NO:22 polypeptide sequence of Orfll 

LRPGRRRAARCPSSTCPPAAGAAPQRVIMKQTSLYYATLGLVGLALAAPARAQEQSIjPVQ 
lAPVVATHGAPEaNGPIiOTjDAVDSTGSRLGIiTLRETPASVW 

givgvdnasppgsagsvsyrgfsgsqvsqlpngisvqydwaarpidswiydrveaiggp 
ssflfgagavggainyvtkvaqrdtfydgqlrlgsygarqasvglnrqlagepggr^ 

RIDANANASDGP^VTKSNRSHAEQVAASIiLSDIjGERVTHTI^ 

GDGVWGEGHIRGGTRWKimJVDDGRYEQSVWmjRSnTEWQASDRLSPRm'LYYYI^ 
FQOTjETYRYNPGNSQVIiRSGAIiliQRHEQRLLGNRrEGIiYHGSIiGGIjR^ 
QTRYPTSVAGQVDSVDPYEFDPGEFYDIPGMRRGHVPDRDNKVRTrlAF^ttlENRTE^ 
ALVTALRHDI IDLDLTNRRAASAASPGHASRRYNPTTGRVAVNWEVSPGATLYAQYAT^ 
DPPSGVLSTATFADVIilJOTKIiTTGTQVEAGGKFAFWDGRGTAT^ 

NPGSSLPVGSQSARGLEIiAGGLQLTRALSIiQAlJUiALVDPRYDDFSQWGGVAVSRNGKVP 

VOTPRRLANVWLDYAFAPDWRASIiAARHVGKTYADAANTV^ 

SWARVRNLTDKVnfAASVTGAPMYYLGAPRSVEIiAIjQARF 
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SEQ ID NO:23 polynucleotide sequence of Orfl2 

Atgaaatcccgctcactccggcgctgcgccggtgtcctggcctgtgtcgctccgttggcc 
Ggccacgcccaggccggcgccgccgccggccaacccatccccgaactcgatccggtcgtc 
Gtcaccgccgcgcgatcgccccagctgctcaagaatgtgctggccgacgccagcgtgatc 
gagcgcgatacgctggcgcgcgccggccagtccagcctggccgaagtgctggcgcagcag 
cacggcatcgaattcgccgacagcggcggcccgcaaagcgtcaccagcctgttcatgcgc 
ggcgccaacagcaaccagaccctggtcctgctcaacggccagcgcatcaacaacgccaac 
SScggcggcattgcgctcaacgcgctgccgccggaagccatcgaacgcatcgagatcatg 
cgtggcgcggccagcagcctgtacggggccgacgcgatcggcggcgtgatcaacatcatt 
acccgcgagccgggcgacaaggcgctgtcggcctatgccaacgccggttacggcacctac 
ggcaccagccgctacgacgccggcgtctcgggcgcggccgacggcttcagctacagcctg 
tccaccggctatggccagagccatggcttcaacgccaccaaccgccgctcgttctcgtac 
aacccggacaaggacagctactaccagaactacgccaccggcacgctgggctacgaatgg 
cggcccgagcagaaagtggtggcgcaggtctaccgcagccgcatcaacggcggctacgac 
gcctcggcctcgtacgactacaacgaccgctacatccaggacctgcaggcctattcgctg 
gccagcgaaaaccgcctgacccgctactggaagagcacgctgcgcgccggctatgtggaa 
gacaagaacgattcgcgcgccgaaggcatgttcgaagacaacaacacgcgcttccggacc 
cgccagatgcagtacctgtggcagaacgacttcaccctggccgccggccagacgctgacg 
ctggcctacgagcacctggaccagcgcgccgacggccagatgagcaccgccaccggcatc 
ggcaactacaccgagacgcgccgccacgtgaactcgtacaccggcgtctacctgggcgat 
ttcggccgccaccatgtgcaggccagcctgcgcaacgacaacaactcgcagttcggcagc 
cacaccaccggcggcctggcctacgggttcgacctgacgcccaacctgcgcgccaccgtg 
gccgccaacacgggctttcgggcgccgtcgttcaacgatctgtacacgccgaccagcgcg 
ttcggctatcgcggcaaccccgacctcaagccggaagagtcgcgcaacgccgagatcggc 
ctgaaataccaggacgaggacagcgaactgggcgtggtgtattaccagacccgcatcaag 
aacctgatccaggtgaccgaggacttcagcacggtcgagaacgtcgggcgcgcccgcctg 
caaggcttcaccatcagcggcgcgcaccgcttcggcaacacgcgcctgcgcgccagcctg 
gacctgagcaacccgcgcaacgaagacaccggcaagcaattgctgcgccgcgcccgcacg 
gtgctgcgcgccggcatcgaccatcgcttcgaccgcctgctggtgggcgccgagtggtac 
gcctcggacgagcgctacgactacggcttccccgaggaaaagcgcctgggcggctacggc 
ctggtcaacctgaccgcggcctacgacctgagccgcaacatgcaggtgcaggtgcgctgg 
aacaacgtgctcggccagcgctacaccttggccgacggctacaacacggccggctcgaac 
gcct t cgt caacccgt cgt ggcgcat gt ag 

SEQ ID NO:24 polypeptide sequence of Orfl2 

MKSRSLRRCAGVLACVAPIiAGHAQAGAAAGQPIPELDPVVVTAARSPQLLKI^^ 

ERDTLARAGQSSLAET^niiAQQHGIEFADSGGPQSVTSLFMRGANSNQTLVL 

GGGIALNALPPEAIERIEIMRGAASSIiYGADAIGGVINIITREPGDKALSAYAN^ 

GTSRYDAGVSGAADGFSYSLSTGYGQSHGFNATNRRSFSYNPDKDSYYQNYATGTLGYEW 

RPEQKVVAQVYRSRINGGYDASASYDYNDRYIQDLQAYSLASENRIjTRYWKSTLRAGYVE 

DKNDSRAEGMFEDNISrrRFRTRQMQYLWQNDPTLAAGQTLTLAYEHIjD 

GKrrTETRRHVNSYTGVYLGDFGRHHVQASIiRNDNNSQFGSH^ 

AANTGFRAPSFOT)i:iYTPTSAPGYRGNPDLK:PEESRNAEIGLKyQDEDSEIiGVVYYQTO 
NLIQVTEDFSTVENVGRARIjQGFTISGAHRPGNTRIjRASIiDLSNPRNBDTGKQIiLR^ 
VLRAGIDHRFDRIjIiVGAEWYASDERYDYGFPEBKRLGGYGLVNLTAAYDLSRNMQVQVRW 
NNVIiGQRYTIiADGYNTAGSNAFVNPSWRM 

SEQ ID NO:25 polynucleotide sequence of Orfl3 

Atgattccaccttgccgcttatccctgatcccggcgctggccgccatggcgctggcaggc 
Gcctttcccgcgccgagcggggccgcgccggctgaattggcgcccatcgcggtcatcggc 
Gacgatcccgacgatccgcgggtattcgaaggcagcaccgccacccgtaccgccacaccg 
ctgcgggaggtgccgcagacggtcgacaccgtgaaggtgccggacgccctgaactatggc 
gcgcgcacgctgggcgaggcgctggccggcgtgcccaatgtcaccgacgccagcgatacc 
cgcttcgacggcttgcgcatacgcgggttcgacgccggcagcgacttctacctggacggg 
gtgcgcgatgacagccagtacgtgcgcgacctgcacaacatcgagcgcatcgaggtgctc 
aaggggccggccggcgttctgtacggccgcggcagccagggcggcatcgtcaatcgggtg 



123 



wo 2005/032584 



PCT/EP2004/011082 



agcaaggcgcccgggccgggccgcgcttccaccctcgaagtccggctgggcggcgaggac 
tttcgcagcctgtacgccgacctgagcgcggacccttccgacacggtcagcctgcgcctg 
aacgtgggcggcgagaatgcgggcagtttcaggcacggggtcagctcgcgccgccgcctg 
gcgtcgcccgccttggcgtggcgcattacgccacggctcgattggctggcgcagtacgaa 
cacagccgctacgaccgcgtgcccgaccgcggcattccctcggtggacggccggcccgcg 
ccggtcgggcgctcgaccgtctacggcgaccccgggcgcgacaatatcgacgatcgggtc 
caggtgctgcgctcgcgcctgcgctaccgggcggccaatggatgggagctgcgccatacc 
ctgtcgacgttccggctgcatagcgatttcgacaacacctatctgtccggctggcgcgcc 
gagaccgggctggtgcaacgccagcgctggcagcagcacctgcgcgcccggcatctttac 
aacgtcttcgaggccgagggcacgttcgccaccggctggctcgaacaccgcttgctggcc 
ggcgtcgagctgggcagccagcatcgcgatccgacgctgcaccgcgcggccaccaaaggc 
cccggcgcgcagccggtgcccgggctggcgctgcaccaccccgacttgagccagcagcac 
cacggccgcatggagcgcgccagcgatgcgcgtcaccgcgtgcgtacgcaaggctactac 
ttgcaggatcaactgcgattgagcgagtcctggcaggtggtggcgggcgcgcgcctggac 
cggttcggggtgcgcacgcgcaatcgcctgctgggcctggaaggcagccgtggcgaccgc 
agtgtgagtccgcgcctgggagtggtctggacgccctggccggcgcacgcgttctacgcg 
tcgtacagcaagactttctcgcccaccggcggcggcaccataggcatcacgccggacgcg 
cggggcaacgccaatgatctgccgcccgaacatacgcgccagtacgaagccggggtcaag 
agcgactggctggacgggcgcctgagcaccatgctggccgtctaccagctcgaactctac 
aaccgccgcacgcgcgcgccccacgatcccacgcggatactcctgacgggcctgcagcgc 
tcgcgcggcctggaaatgagcggggcggggcggctagctgtgaagattcaatag 

SEQ ID NO:26 polypeptide sequence of Orfl3 

MIPPCRLSLIPAIjAAMAIjAGAFPAPSGAAPAELAPIAVTGDDPDDPRVPEGSTATRTATP 
IJIlEVPQTVDTVKVPDAIi^^fGARTIJGEAIJAGVPlmT)A^ 

VRDDSQYVraiiHNIERIEVIiKGPAGVLYGRGSQGGIVNRVSKAPGPGRASTLE^ 

FRSXiYADIiSADPSDWSIiRIjNVGGENAGSFRHGVSSRRRLASPAIiAWRITPRI^ 

HSRYDRVPDRGIPSVDGRPAPVGRSTVYGDPGRDNIDDRVQVIjRSRLRyRAANGWEIjRHT 

LSTPRLHSDFDOTTLSGWRAETGIjVQRQRWQQHLRARHIiYNVFEAEGTPATGWLEHRI^ 

GVELGSQHRDPTLHRAATKGPGAQPWGIiALHHPDIaSQQHHGRMERASDARHRVRTQGYY 

LQDQIiRLSESWQWAGARLDRPGVRTRNRIiLGIiEGSRGDRSVSPRIiGV^^ 

SYSKTFSPTGGGTIGITPDARGNANDIjPPEHTRQYEAGVKSDWIjDGRLSTMIjAVyQL 

NRRTRAPHDPTRIDLTGLQRSRGLEMSGAGRLAVKIQ 

SEQ ID NO:27 polynucleotide sequence of Orfl4 

Atgaacacgctgcgacgcctgcgcatcctgggcgccgccgccacgctgggcgggccggcc 
Gccgcgcaggaggcgcccgccatgctggagccggtgcgcatcagcggcacgcgcaccggc 
Acctcggtgctcgatacgcccgcgtccgtggacgtggtcgatggccacgagctgcgcgcg 
Cgcaacctgcaggtcaacctgtccgaaggcttggccggcgtgcccggactgcagctgcag 
Aaccgccagaattacgcgcaggacctgcagctgtcgatacgcggcttcggcgcgcgctcg 
Accttcggcgtgcgcggcgtgcggctgtacgtggacggcatcccggccaccatgcccgac 
Ggccagggccagacctcgaacatcgacatcggctcggccggccgcgtggaagtgctgcgc 
Ggcccgttctcggccctgtacggcaattcgtcgggcggcgtggtgcaggtgttcaccgaa 
Cagggcagcgatccgcccgaggcgacgggcagcgcggcggcgggcagcttcggcacctgg 
Cgctacggcgccaagctgcgcggcgccagcgcggcagacggcctggattacgtgctggac 
Ttcaatcgcttcacgaccgagggctatcgcgaccacagcgccgcgcgcaagaacctgggc 
Aacgcgcggctgggcctgcgcatggacgacggcagccgcctgacgctgagcgccaaccac 
Gtggacctgaccgcgcaggatccgctgggcctgacgcgcgagcaattcgaggacgacccg 
Cgcagcgcgccggtggccgagcgcttcgatacgcgcaagaccgtgcgccagacccagggc 
Ggcctgctgtacgagcgcgccttcgacacgcgcaacgacctgcgcgtgatgctgtactac 
Ggacaacgccgcaccacgcaataccaatccatcccggtggccgtgcagcaaagccccacg 
Caggccggcggcgtgatcgacctgggccgcgactacggcggcgccgacctacgctggacc 
Tcgcgccagcaggtggccggcctgccgctgaccctgatcggcggactggcctatgacacc 
Atgaaggagcagcgccgcggctacgacaactacaccggcccgcccgctgcgcccaccggc 
CatgggcgtcaagggcgcgttgcggcgcgacgagaccaacacggtctacaacctggaccG 
Gtacctgcaggcctcgtggcagttcgccgagcgctggacgctggacgcggggctgcgcta 
Cagcacggtgcgcttcgactcggacgatcattaccaggcgccgggcaacggcgacgacag 
Cggacgcgccacctatcgcaaggccttgccggtggcggcgctgcgctatgcggccaacga 
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gaacctgagcctgtacgcctcgtacggacgcggcttcgagacgcccacgctcaatga 
SEQ ID NO:28 polypeptide sequence of Orfl4 

^IOTLRRllRIIiG^VAATLGGPAAAQEAPAMIJEPVRI SGTRTGTSVIiDTPASVDWDGHEIjElA 

RNIjQVNIjSEGIjAG\n?GIiQIjQNRQlSryAQDIiQLS IRGFG^ PATMPD 

GQGQTSNIDIGSAGRVEVLRGPFSALYGNSSGGWQVFTEQGSDPPEATGSAAAaSFGTW 

RYGAKLRGASAADGLDYVIiDFJniFTTEGYRDHSAARKNLGNARIjGlJ^ 

VDIiTAQDPLGLTREQFEDDPRSAPVAERFDTRKTVRQTQGGIjIaYERAFDTRNDLRV^ 

GQRRTTQYQSIPVAVQQSPTQAGGVIDIiGRDYGGADrjRWTSRQQVAGLPIiTIjIGGIjAYDT 

MKEQRRGYDNYTGPPAAPTGHGRQGRVAARRDQHGLQPGPVPAGLVAVRRAIjDAGRGAAL 

QHGAIiRIiGRSLPGAGQRRRQRTRHIiSQGl4AGGGAAliCGQREPEPVRraVRTRIiRDAHAQ 

SEQ ID NO:29 polynucleotide sequence of OrflS 

Atgaacatgtctctgtcacgcattgtcaaggcggcgcccctgcgccgcaccacgctggcc 

Atggcgctgggcgcgctgggcgccgccccggcggcgcatgccgactggaacaaccagtcc 

Atcgtcaagaccggtgagcgccagcatggcatccatatccagggctccgacccgggcggc 

gtacggaccgccagcggaaccaccatcaaggtaagcggccgtcaggcccagggcatcctg 

ctagaaaatcccgcggccgagctgcagttccggaacggcagtgtcacgtcgtcgggacag 

ttgtccgacgatggcatccggcgctttctgggcaccgtcaccgtcaaggccggcaagctg 

gtcgccgatcacgccacgctggccaacgttggcgacacctgggacgacgacggcatcgcg 

ctctatgtggccggcgaacaggcccaggccagcatcgccgacagcaccctgcagggcgct 

g9cggcgtgcagatcgagcgcggcgccaatgtcacggtccaacgcagcgccatcgtcgac 

gggggcttgcatatcggcgccctgcagtcattgcagccggaagaccttccgcccagccgg 

gtggtgctgcgcgacaccaacgtgaccgccgtgcccgccagcggcgcgcccgcggcggtg 

tctgtgttgggggccagtgagcttacgctcgacggcgggcacatcaccggcgggcgggca 

gcgggggtggcggccatgcaaggggcggtcgtgcatctgcagcgcgcgacgatacggcgc 

ggggacgcgcctgccggcggtgcggttcccggcggtgcggttcccggtggtgcggttccc 

ggcggcttcggtcccggcggcttcggtcccgtcctcgacggctggtatggcgtggacgta 

tcgggct ccagcgtggagct cgcccagt cgat cgt cgaggcgccggagctgggcgccgca 

atccgggtgggccgcggcgccagggtgacggtgtcgggcggcagcttgtccgcaccgcac 

ggcaatgtcatcgagaccggcggcgcgcgtcgctttgcgcctcaagccgcgcccctgtcg 

atcaccttgcaggccggcgcgcatgcccaggggaaagcgctgctgtaccgggtcctgccg 

gagcccgtgaagctgacgctgaccgggggcgccgatgcgcagggcgacatcgtcgcgacg 

gagctgccctccattcccggcacgtcgatcgggccgctcgacgtggcgctggccagccag 

gcccgatggacgggcgctacccgcgcggtcgactcgctgtccatcgacaacgccacctgg 

gtcatgacggacaactcgaacgtcggtgcgctacggctggccagcgacggcagcgtcgat 

ttccagcagccggccgaagctgggcggttcaaggtcctgacggtcaatacgctggcgggt 

tcggggctgttccgcatgaatgtcttcgcggacctggggctgagcgacaagctggtcgtc 

atgcaggacgccagcggccagcacaggctgtgggtccgcaacagcggcagcgagccggcc 

agcgccaacaccctgctgctggtgcagacgccactaggcagcgcggcgacctttaccctt 

gccaacaaggacggcaaggtcgatatcggtacctatcgctatcgattggccgccaacggc 

aatgggcagtggagcctggtgggcgcgaaggcgccgccggcgcccaagcccgcgccgcag 

ccgggtccccagccgccgcagccgccgcagccgcagccggaagcgccggcgccgcaaccg 

ccggcgggcagggagttgtccgccgccgccaacgcggcggtcaacacgggtggggtgggc 

ctggccagcacgctctggtacgccgaaagcaatgcgttgtccaagcgcctgggcgagttg 

cgcctgaatccggacgccggcggcgcctggggccgcggcttcgcgcaacgccagcagctg 

gacaaccgcgccgggcggcgcttcgaccagaaggtggccggcttcgagctgggcgccgac 

cacgcggtggcggtggccggcggacgctggcacctgggcgggctggccggctatacgcgc 

ggcgaccgcggcttcaccggcgacggcggcggccacaccgacagcgtgcatgtcgggggc 

tatgccacatatatcgccgacagcggtttctacctggacgcgacgctgcgcgccagccgc 

ctggagaatgacttcaaggtggcgggcagcgacgggtacgcggtcaagggcaagtaccgc 

acccatggggtgggcgcctcgctcgaggcgggccggcgctttacccatgccgacggctgg 

ttcctcgagccgcaggccgagctggcggtattccgggccggcggcggtgcgtaccgcgcg 

gccaacggcctgcgggtgcgcgacgaaggcggcagctcggtgctgggtcgcctgggcctg 

gaggtcggcaagcgcatcgaactggcaggcggcaggcaggtgcagccatacatcaaggcc 

agcgtgctgcaggagttcgacggcgcgggtacggtacacaccaacggcatcgcgcaccgc 

accgaactgcgcggcacgcgcgccgaactgggcctgggcatggccgccgcgctgggccgc 

ggccacagcctgtatgcctcgtacgagtactccaagggcccgaagctggccatgccgtgg 
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accttccacgcgggctaccggtacagctggtaa 
SEQ ID NO:30 polypeptide sequence of OrflS 

MmSLSRIVKAAPIiRRTTIiZVMALGAIiGAAPAA^^ 

WTASGTTIKVSGRQAQGILLENPAAELQPRNGSVTSSGQLSDDQIRRFLGTVTVKAGKL 

VMHATLANVGDTTODDGIALYVAGEQAQASIADSTLQGAGGVQIE^ 

GGLHlGAIiQSIiQPEI>LPPSRVVLRDTNVTAVPASGAPAAVSVIiGASEIjTI^ 

AGVAAMQGAWHLQRATI RRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDV 

SGSSVEIiAQSIVEAPEIiGAAIRVGRGARVWSGGSLSAPHGNVIETGGARRFAPQi^ 

ITLQAGAHAQGKALiLYRVLPEPVKIiTLTGGADAQGDIVATELPSIPGTSIGPIiDVAIiM 

ARWTGATRAVDSIiSIDNATWVMTDNSNVGALRIJ^IXSSVDFQQPAE^ 

SGLFRIWVFADLGLSDKIiVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPLGSAATFTL 

ANKDGKVDIGTYRYRIjAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQP 

PAGRELSAAANAAVNTGGVGIiASTIiWYAESNALSKRLGELRIiN 

DNRAGRRFDQK^GFEIiGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGG 

YATYIADSGFyiiDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRPTHADGW 

PLEPQAEIiAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIEIiAGGRQVQPYIKA 

SVLQEFDGAGTVHTNGIAHRTELRGTRAEIiGIiC^^AAAIjGRGHSLYASYEYSKGPK^^ 

TFHAGYRYSW 

SEQ ID NO:31 polypeptide sequence of Orfl6 

Atggcaggacaagcgaggggatggtacggcgcaggcggacgccacccaatacattttcaa 
Atttcggcgggcgctgcgttgatgctgggcctgctggacgtcgccggcgccgccgctgtc 
Acggcagcgcagcgaatagatggcggcgcggcgtttctgggcgatgtcgccatagcgacg 
Accaaggcgtccgagcacggtatcaacgtgactggccgcacggcagaggttcgggtgacg 
Ggcggcaccatacggacgagcggcaaccaggcccagggcttgcgggtcggcacggagaat 
Gcaccggacaacaccgcgctgggcgcgtcggtctttttgcagaacctgatcatcgagact 
Tccgggaccggggcattgggcgtctctgtccacgagccacagggaggaggaggcacgcgc 
ttgtccatgtccgggacgacggtgcgcacgcgcggcgatgacagtttcgccctgcagctt 
tcagggcctgccagcgccaccttgaatgacgtggcgctggagacggccggccagcaggcg 
cccgcggtggtgctgtggcaaggcgcacagttgaacgcacaggggctggtggttcaggtc 
^acggggcaggcgtttccgcgatacatgcgcaggatgccggcagcttcacgttgtcgggc 
tcggatattaccgcccggggcctggaagtcgccgggatctatgtgcaggaaggcatgcag 
gggacgttgacgggtacgcgggtcacgacgcagggcgataccgcgcccgccttgcaggtg 
gaggacgcgggtacgcacgtcagcatgaacggcggcgcgttgtcgacctccggcgcgaat 
tcgcccgctgcatggctgctggctggcggttccgcgcagttccgcgatacggtattgagg 
accgtcggcgaggcctcgcatggcgtggacgtcgctgcgcacagcgaggtcgaactggcg 
catgcgcaggtgcgggccgacgggcaaggggctcatggcctggtggtgacgcgaagcagc 
gcgatggtgcgggcgggttcactggtagagagcaccggagacggcgccgcggcgctgctg 
gaaagcgggcatcttacggtggacggcagcgtggtccatggccacggcgcggccgggttg 
gaggtcgacggcgagagtaatgtgtccctgctcaacggcgcacgcctgtcgtcggaccag 
ccgacggcgatcaggctgatcgaccctcggtcggtcctgaacctcgacatcaaggaccgg 
gcgcagctattgggcgacattgcgccagaggcgcagcagccggacggttcgcccgagcag 
gccagggttcgtgtggcgctcgccgacggggggacgtgggcgggccgcacggacggcgcg 
gtccatacggtgcgattgctcgatcgtggcgtctggaccgtgacgggcgattcccgggtg 
gccgaggtcaagctggagggcggcacgctggcgtttgcgccacctgcgcagcccaagggc 
gctttcaagacactggtcgcgacgcagggcatttccggtacgggcacgatagtcatgaat 
gcacatttgcccagcggcacggccgatgtgctggtggcgccgcagggattcggcgaccgg 
caggtgctggtggtcaacaacacggatgatggcaccgagagcggcgcgaccaaggtgccg 
ctgatcgaagacgaacaaggccatacggcgttcacgctgggcaacatggggggacgggtg 
gacgcgggtgcgcgccagtacgaattgaccgcgagcgaggcgcaggccgacaaggcccgc 
acctggcagctgacgccgaccaacgagttgtccaccacggcgaccgccgccgtgaatgcg 
atggcgatcgcggcgtcgcagcgcatctggcaggccgaaatggacgtgttgctgcgccat 
atgagcggcct gcat t cga t cgggt cgccgggcggat t ct gggcgcgcggcctgagccag 
cgccagaggct cgat accggtt acggaccctggcagaagcagaccgt cagcggaat agag 
ctgggcctcgacaggcgggtggccggcggcgcaacgacggcgtggtccgtcggcatgctg 
gccggctacagcgagacccggcgcgatggcggcgcataccgcgccgggcatgtgcacagc 
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gcgcacgtcggcgcgtatgtctcctacctgaatgattcgggctcgtatgtggatggcgtg 
gtcaagtacaaccgctttcggcatggtttcgacattcgcacgaccgacctgaagcgggtc 
gatgccaagcaccgcagccacggcctgggcgcgttgctgcgcggcgggcgccgtatcgat 
atcgatggcggctggtatgtcgagccgcaggcttcggtggcgtggttccacgccggcggg 
agccgctatgaggccagcaatggcctgcgcgtgcgcgccgacggcgcgcattcatgggtg 
ttgcgcgccggggcggaggcgggccggcagatgaggttggccaatggcaatatcgttgaa 
ccctatgcgcgcttgggctgggcccaggagctgggggccgataacgcggtctacaccaac 
ggcatcaggcatgtcacgcgttcgcgtggcggattcgccgaggcccgcgtgggggtgggc 
gccttgctgggcaagcggcatgccttgtacgccgactacgagtatgccaagggcgcgcgg 
ttcgaggcgccctggaccttgcagctggggtatcgctacagctggtga 

SEQ ID N0232 polypeptide sequence of Orfl6 

MAGQARGWYGAGGRHPIHFQISAGAAiMIiGliLiDVAGAAAVTAAQRiDGGAAFL 

TKASEHGINVTGRTAEVRVTGGTIRTSGNQAQGLRVGTENAPDNTAIiGASVFLQNIillET 

SGTGALGVSVHEPQGGGGTRLSMSGTTVRTRGDDSFALQLSGPASATIJSro 

PAVVLWQGAQIiNAQGIiWQVNGAGVSAIHAQDAGSFTIjSGSDITARGIiEVAGIYVQEGMQ 

GTLTGTRVTTQGDTAPALQVEDAGTHVSMNGGALSTSGANSPAAWLIiAGGSAQPRDTVIjR 

TVGBASHGVDVAAHSEVELAHAQVRAIX3QGAHGLVVTRSSAMVRAGSI)VESTGD(^^ 

ESGHIiTVDGSVVHGHGAAGLEVDGESNVSIjLNGARLSSDQPTAIRLIDPRSVLI^ 

AQLLGDIAPEAQQPDGSPBQARVRVALADGGTWAGRTIXSAVHTV^^ 

AEVKLEGGTIiAPAPPAQPRSAPKTLVATQGISGTGTIVMNAHLPSGTADV^ 

QVLVVNOTDDGTESGATKVPLIEDEQGHTAFTLGKnyiGGRVDAGARQyELTASEAQJ^ 

TWQLTPTNELSTTATAAWAMAIAASQRIWQAEMDVIiliRHI^ 

RQRLDTGYGPWQKQTVSGIELGLDRRVAGGATTAWSVC^dTjAGYSETRRDGGAYRAGHVHS 

AHVGAYVSYIJroSGSTVDGVVKYNRFRHGFDIRTTDIjKRVDAK^ 

IDGGWYVEPQASVAWFHAGGSRYEASNGLRVRADGAHSWVLRAGAEAGRQMRLANGN 

PYARLGWAQEIjGADNAVYTNGIRHVTRSRGGFAEARVGVGi^LIiGKRH^ 

PEAPWTLQIiGYRYSW 

SEQ ID NO:33 polynucleotide sequence of Orfl7 

Atgtatctcgatagattccgtcaatgtccgtcttccttgcagatcccgcgttccgcgtgg 
Cgcctgcatgcgctggccgcagctctggcgctggccggcatggcccggctggcgcccgcg 
Gcggcgcaggcgccgcagccgcccgtggccggcgcgccgcatgcgcaggacgccgggcag 
Gaaggagagttcgaccaccgggacaacacgctcattgcagtctttgacgacggcgtcggc 
Atcaatctcgacgacgatcccgacgagctcggcgagacggcgccccccacgctcaaggac 
Atccacatctcggtggagcacaagaacccgatgagcaagccggccatcggggtgcgtgtc 
Agcggcgccggccgcgcgctgacgctggccggctcgaccatcgatgccaccgagggcggc 
Attcccgcagtggtacggcgcggcggcacgctggagctggatggcgtcaccgtggcgggc 
Ggggaagggatggagccgatgacggtctctgacgccggcagccgcctgagcgtgcgcggc 
Ggcgtgctgggcggcgaagcgccgggcgtcggcctggtccgggccgcgcaaggcggccag 
Gcgagcatcatcgacgcgacgctgcagagcatcctcgggcccgcgctcattgccgacggc 
Ggctccatttccgtcgccggcggttcgatcgacatggacatgggcccgggattcccgccg 
Ccgcctccaccgcttcccggggcgccgctggccgcgcatccgccgctcgatcgcgttgcc 
Gcggtgcacgccggccaggacggcaaggtgacactgcgggaggtggcgctgcgggctcac 
Gggccgcaggcgacgggcgtctatgcgtatatgcctggcagcgaaatcaccctgcaggga 
Ggcacagtcagcgtgcagggcgatgacggggccggcgtggtcgccggcgcgggcctgctc 
Gacgccttgccgcccggcggcacggtgcggctggacggaaccacggtgtcgaccgatggc 
Gccaacaccgatgccgtgctggttcgcggcgacgcggcgcgcgccgaggtcgtcaacacc 
Gtgct^cgcaccgccaagagcctggccgccggcgtatcggcccagcatggaggccgcgtc 
Acgctgcggcagacccgcatcgagaccgcgggcgcgggggccgagggcatctccgtgctg 
Ggcttcgagccgcagtccggctccggcccggccagcgtcgacatgcagggcggcagcatc 
Accacgaccggcaaccgcgccgccggcatcgcgctcacccacggcagcgcccgcctggaa 
Ggcgtggcggtgcgcgccgagggcagcggctcgagcgccgcgcagctggccaacggcacg 
C t ggt cgt c agcgcagggt cgc t ggcc t cggcc cagt c cggcgcga t cagcgt gaccgac 
Acgccgctgaagctgatgccgggcgccctggccagcagcacggtctcggtccggttgacc 
Gacggcgccacggcgcaaggcggcaatggcgtgttcctccagcagcattccaccattccg 
Gtggcggttgccctcgagagcggcgccctggctcgcggcgatatcgtcgccgacggcaac 
Aagcccctcgatgccgggatctccctcagcgtggccagcggcgccgcctggcacggcgcc 
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Acccaggtgctccagtcggccacgctgggcaagggcggaacctgggtcgtgaacgccgac 
Tcccgggtgcaggacatgtcgatgcgcggcgggcgggtcgaattccaggcgcccgcgccc 
Gaggcctcttacaagaccctgaccctgcaaaccctggacggcaacggcgtgttcgtgctg 
Aacaccaacgtcgccgccgggcagaacgaccagttgcgggtcaccggccgcgccgatggc 
Cagcaccgcgtgctggtgcgcaatgccggaggcgaggccgacagccggggcgcccgcctg 
Ggcctggtgcatacccaggggcagggcaacgccaccttccggctggccaacgtcggcaag 
Gcggttgacctgggcacgtggcgctacagcctggcggaggatccgaagacgcatgtctgg 
Agcttgcagcgcgcgggccaggccctgtcgggggcggccaatgccgccgtgaacgcggcg 
Gatctttccagcatcgccctggccgagtccaacgcgctggacaagcgcctgggcgagctg 
Cgcctgcgcgccgacgccggcgggccatgggcgcgtacgttcagcgagcgccagcagatc 
Agcaaccgccacgcccgcgcctacgaccagacggtcagcgggctggagatcggcctggac 
Cgtggctggagcgcgtcgggcgggcgctggtacgccggcggcctgctcggctacacctat 
Gccgaccgcacctatcccggcgacggtggcggcaaggtcaagggcctgcacgtcggcggc 
Tacgccgcctatgtcggcgatggcggctactatctcgacaccgtgctgcggctgggccgc 
Tacgatcagcaatacaacattgccggcaccgatggcggccgcgtcaccgccgactaccgc 
Acaagcggcgccgcatggtcgctcgaaggcgggcgccggttcgagctgcccaacgactgg 
Ttcgccgaaccgcaggccgaggtcatgctgtggcgcacgtcaggcaagcgctatcgcgcc 
Agcaatggcctgcgcgtcaaggtggacgccaacaccgccacgctgggccgcctgggcttg 
Cgcttcggccgccgcatcgccctggccggcggcaacatcgtgcagccctacgccaggctc 
Ggctggacgcaggagttcaaaagcacgggcgatgtgcgcaccaatggcafctggccatgcc 
Ggcgcaggccgccacggccgcgtggaactgggcgcgggcgtcgacgccgcgttgggcaag 
Gggcacaacctctatgcttcgtacgagtacgcggcgggcgaccggatcaacattccgtgg 
tcgttccacgccggctaccgctacagcttctga 

SEQ ID NO:34 polypeptide sequence of Orfl? 

^raJDRFRQCPS SLQI PRS AWRLHALAAAIiALAGMARI^ 

EGEFDHRDOTIjIAVFDIX3VGINIJ:^DDPDBriGETAPPTLKI5IHISVEHKNPMSKPA 

SGAGRALTIiAGSTIDATEGGIPAVVRRGGTLEIiIXSVTVAGGEGMEPMTVSDAGSRIjSVRG 

GVLGGEAPGVGLVRAAQGGQASIIDATLQSILGPALIADGGSISVAGGSIDMDMGPGFPP 

PPPPrlPGAPIAAHPPrlDRVAAVHAGQDGKOTLRErVALRAHGPQATGVYAY^^ 

GWSVQGDDGAGVVAGAGLLDALPPGGTVRLDGTWSTDGAim)AVliVRG 

VLRTAKSIjAAGVSAQHGGRVTLRQTRIETAGAGAEGISVIiGFEPQSGSGPASVDMQGGSI 

TTTGNRAAGIALTHGSARIiEGVAVRAEGSGSSAAQIjANGTLWSAGSIiASAQSGAISV^ 

TPLKLMPGAIASSTVSVRLTDGATAQGGNGVFLQQHSTIPVAVALESGALARGDIVADGN 

KPLDAGISLSVASGAAWHGATQVLQSATLGKGGTWVVNADSRVQDMSMRGGRVEF 

EASYKTLTLQTrlDGNGVFVtJSFTNVAAGQ^^ 

GLVHTQGQGNATFRIiAl!TVGKAVDIiGTmYSIiAEDPKTHW 

DLSSIALAESNAIiDKRIiGELRIiRADAGGPWARTFSERQQIS^nyiARAYDQTVSGIjEIGIjD 
RGWSASGGRWYAGGLIiGYTYADRTYPGDGGGKA^KGIiHVGGYAAYVGDGGYYLD^^ 
YDQQYNIAGTDGGRVTADYRTSGAAWSIiEGGRRFELPNDWFAEPQAEVMLWRTSGKR^^ 
SNGLRVKVDANTATIiGRLGLRFGRRIAIiAGGNIVQPYARLGWTQEFKSTGDVRT^^ 
GAGRHGRVEIiGAGVDAAIiGKGHISIIjYAS YEYAAGDRINI PWSFHAGYRYSP 

SEQ ID NO:35 polynucleotide sequence of Orfl8 

Atgcacatttacggaaatatgaatcgagcaacaccttgccgcggtgcggtgcgtgccctt 
Gcgcttgccctgctgggagcgggtatgtggacactttctcctccctcggcatgggcgctt 
Aagctcccgtcgctgctgacggacgacgagctgaagctggttctgccgactggcatgtct 
ctggaggatttcaagcgcagccttcaggagtccgcgccgagcgcgctggcaacgccgccg 
tcgtcttcgcctccggttgcgaagccaggtccgggctccgttgccgaggctccgtcgggg 
tcgggccacaaggacaacccatcccctcccgtcgtcggcgtcggtccaggtatggcggag 
tcgtctggcggacataacGccggcgtgggggggggcacgcatgaaaatgggttgcccggt 
ataggaaaggtcggcgggtctgcgcccggaccggataccagtacgggctcgggtcccgac 
gccggcatggcgtccggagcgggttcgacgtcgcccggcgcatcgggtggggcgggcaag 
gatgcgatgccgccctcggaaggcgagaggccggactccggtatgtccgattcggggcgg 
ggtggcgaatcgtcggctggaggcttgaatccggacggcgctggcaagccaccgcgggag 
gaaggcgagccgggttccaagtctcctgcggacggtggccaggacgggccgccgccgccc 
cgggacggcggcgatgcggatccgcaacctccgcgggacgatggcaatggggaacagcag 
ccgcccaagggcggcggggatgaggggcagcgcccgccgcctgccgccggaaacggtggc 
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aacggtggcaacgggaacgcgcagcttcccgagcgcggcgacgacgcgggtccgaagcct 
cccgagggagagggcggcgatgaaggtccgcaaccgccgcagggcggcggcgagcaggac 
gcgccggaggttcctcccgtcgcgccggcgccgcccgcgggcaacggtgtctatgacccg 
ggcacgcataccttgaccacgccggcctctgcggcggtgagcctggccagcagttcgcat 
ggcgtatggcaggccgagatgaacgcgttgagcaagcgcatgggcgagttgcgcctgacg 
ccggttgcgggcggcgtatggggccgcgcttttggccggcgccaggacgtcgacaaccgc 
gtgtcgcgcgagttccgccagaccatcagcggtttcgaactgggcgccgataccgccttg 
ccggtggccgacgggcgctggcacgtgggcgcggtggctggctacaccaacggccgcatc 
aagttcgaccggggcggcacgggcgatgacgacagcgtgcacgtgggcgcttacgctacc 
tacatcgaggacggcggtttctatatggatggcatcgtgcgggtcagccgcattcgccac 
gcgttcaaggtggacgacgccaagggccggcgcgtgcgcggccagtaccgcggcaatggc 
gtgggcgcgtcgctggaactgggcaagcgcttcacgtggcccggcgcctggtacgtggag 
ccgcagctggaggtggccgccttccatgcgcaaggggccgactacaccgccagcaacggc 
ctgcgcatcaaggacgacggcacgaactccatgctgggccgcctgggcctgcacgtgggg 
cggcagttcgacctgggcgatggccgcgtggtgcagccctacatgaagctgagctgggtg 
caggagttcgacggcaagggcacggtgcgcaccaacgacatccggcacaaggtgcggctc 
gatggcggccgcaccgaactggccgtaggggtggcttcgcaactgggcaagcacggcagc 
ctgttcggctcgtacgagtacgccaagggcagccgccagaccatgccgtggaccttccac 
g t cggc t at cgc t acgc c t ggt ag 

SEQ ID NO:36 polypeptide sequence of OrflS 

^fflIYGNMlS^RATPCRGAVRALALALLGAGMWa^ 

LEDFKRSIjQESAPSAIATPPSSSPPVAKPGPGSVAEAJPSGSGHKBNPSPPWGVGPGM/^ 

SSGGHNPGVGGGTHENGLPGIGKVGGSAPGPDTSTGSGPDAGMASGAGSTSPGASGGAGK 

DAMPPSEGERPDSGMSDSGRGGESSAGGLNPDGAGKPPREEGEPGSKSPADGGQDGPPPP 

RDGGDADPQPPRDDGNGEQQPPKGGGDEGQRPPPAAGNGGNGGNGNAQLPERGDDAGPKP 

PEGEGGDEGPQPPQGGGEQDAPEVPPVAPAPPAGNGVYDPGTHTLTTPASAAVSXiASSSH 

GWQAEMNALSKRMGELRLTPVAGGVWGRAFGRRQDVDNRVSREFRQTISGPELGADTAL 

PVADGRWHVGAVAGYTNGRI KFDRGGTGDDDSVHVGAYATYIEDGGFYMDGI VRVSRIRH 

AFKVDDAKGRRVRGQYRGNGVGASIiEIiGKRFTWPGAWYVEPQLEVAAFHAQGADYTAS^^ 

LRIKDDGTNSMIiGRLGIiHVGRQFDLGDGRWQPYMiaiSWQEFDGKGTVRTNDira 

DGGRTELAVGVASQLGKHGSIjFGSYEYAKGSRQTMPWTPHVGYRYAW 

SEQ ID NO:37 polynucleotide sequence of Orfl9 

Atgaaaccgacttccatcctggcacgtttgccccgctatctcggcgcctgcgcgctggcc 
Gcgctggccgcgctggctgtcgcgccgctcgcgccggcgcaggcacagactccgctgccc 
Gcgggactcggcgccgccgaggtgcggcagtatttgtccggcctgccgtccgatgccctg 
cgccagcaggcgtcgtggctggcgccggcgctgttgcgcccctatctgtcaggcctgacg 
gatgcgcaattgcggcaatatgtgcaggcgctgacacccgggcagatcacgcaggggctg 
gcggcgttgacgcctgcgcagcgtgcgcggctgcagcgcgaattcgaacggcaggcgcgc 
cggcaggtgcagcaggcggtacgggccgaggtcgccgcgcgcagcgcgcgggcggtggcg 
atggggcagagcgcatcgatgctgctgctcgacgccgagatgggaaccctggcgcaacgc 
cagggcgatctgcgccgcggccacgacgagggcgccttctgggcgcgcggcagcgcgaac 
cgcttcaaggtcgatacgccggacacaccggcgttcgacctgcgcgtggagtacctgacg 
ctSFSrSrcgccgaccatggctggcgcctggacacggggcggctctatctgggcgcctacgcc 
ggcgtctcgcgcgcccgcatggatgacaacgacatcatgcacggccggatcgaaagccgg 
ttcctgggcacgtacctgacttatgtggacaacggcgggttctacgtcgatgcggtcagc 
aagctggggcgtatcgacgagtccgtgtcgttcgacctgccgctggggctgggcgactac 
gacgacgatatatcgcatacaacgtatacgggcagtgccgaggccggctatcacttcaag 
ttgccgcaacgctggttcgtcgagccgcaggcgcaggtgatctactcgcgcagcagccag 
acgtcggtgcaggggcgggccggcgtgcgcgccggccgggatttcaccctggccggcggc 
gcgaccttgcgtccttatgtcagcgcctcgtacctgcacgagttctcgcacgacgactcg 
gt cgatttcggcggcaagtcgtacgatgccgaactgcc cggc age cgctggo age tgggt 
gccggcgcggcgctggacgtgggggcgcatcgcgcctacgcggatctgcgctatgggcac 
ggcgccaacatcagccaggacctgtcgctgaacatcggctacgcgtaccgcttctag 

SEQ ID NO:38 polypeptide sequence of Orfl9 
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MKPTSILARLPRYIjGAC^UIiAALAAIjAVAPIi^ 

RQQASWIiM^ALLRPYLSGIiTDAQLRQYVQALTPGQITQGLAAIiTPAQRARIjQREFERQM 
RQVQQAVRAEVAARSARAVAMGQSASMIiIjIjDAEMGTLAQRQGDIiRRGm 
RPKVOTPDTPAFDIiRVEYLTLGADHGWRLDTGRLyi^YAGVSR;^^ ESR 
PIjGTyLTSrVDNGGFYVDAVSKLGRIDBSVSPDX.PLGIjGDYDDDI SHTTYT6S AEAGYHPK 
LPQRWPVEPQAQVIYSRSSQTSVQGRAGVRAGRDFTIiAGGATLRPYVSASYIiHEFSHDDS 
VDFGGKSyDAELPGSRWQIiGAGAALDVGAHRAYADI^YGHGANISQDLSIiNIGYAYRF 

SEQ ID NO:39 polynucleotide sequence of Orf20 

AtggtcggcaggagttgtcatcgtgcagggtggttataccgggcgaccttcctcTtatac 

GccgcaaactgtgcttatcccgcgaacgcccaatcggtttctggttccggtcagGtgtcg 

Aacgggccgatcacgtcaccgcactgggtggtgggcggggaactgatcgtcggggatacg 

Ggcgccgggaccttgctcatcgaggccggcggtaccgtgctcaacgactgggcctatatc 

Ggcagtgacaatggcgctgtgggcaccctgacggtgtcgggccgcgacggcgccggggcc 

Gcgtcgacctggacgactgtcgacgatgtgtcgatcggcgttgcggcgggcagcaggggc 

Acgctcgaggtgctcggcggggccagggcgcaaagcggatggggcaccatcggcgtcgct 

Gcaggcagcgtcggaagcgtgaccgtgtccgggcccgggtcggtgtggaatatcgccacg 

Gtcaattcgttccagatcggctcgggcggcagcgggacgctgtggatcgaccagggcggc 

Gcagtgtatagcgggcagggcgtcatcggttggaaccccggcagcgacgggcacgtcacg 

Gtattgggtccggcaacggtatggaacccgctgaacaatatctatgtcggtctcggcggg 

Actggtgaactggatatccgggacggcgcggccgttgcgactgcagggtcgagcccgccg 

Ggcgccgcggcatcgatctacatcgggacgagcgcagggagcgccggcacggtaacggtg 

Tcgagcgcgacggccgtcacctcgacgctcacgtcgaccgaccgtatcgaaatcggctcg 

Gccggcgccggggtgctgactgtcgccaaaggcgggatggtgggcgtcgccagcgacgcc 

Tggatagccatcaccggcacgtcctccggaacgctgaacctgaccggcgatgccagcggc 

Cgaggcgtgctggaaacgggctcggtcatcaagggcgccggcaacgcgaccttcaacctg 

Gatggcggggtcctgcgcgccaatcgtgacgaggccaatttcctcaatggtttctccacg 

Caggcggtgggaagcggcggcgcctggttcgatacgaatgcccatgacgtgggcgttgtc 

Accgccttctcgggtacgtccagcttcaacaagctgggagccggcacgctgacgctgtca 

Ggcaacagcgccgcgttcacggggaacaccgatatccaggccggaacgctgcaggtggac 

Ggcgttcttggcgggccggtggatgtgctggcgggggcgcggttgaccggtaccgggcgc 

Gtcggtgcgacggccaacaagggcaccattgcgccgggcccgcgcagcggctttggcacc 

Ctgacgatcgccggggattacgcggcccagggcggcaacctggaaatccgtacgcagctt 

Ggcgccgacgactcgccgaccgacaggctggtgatcacgggcgccagcgctggcgtgaca 

Ccggtcacggtcgagaatatcggcggcacgggcgcctcgacccagcggggcatacaggtc 

Gtgcaggtcaatggcgcttcggcaggccggttcaacctcgccaacggcgattacgtcatc 

Gaggggcgtccggcgctggtggccggcgcctatggctatgtgctgcagcaggacgccgcc 

Gacggcgattggtatctgaaatcgtcgctgcccgaccctggggctccccaaggcgggggc 

GgtctgccgggcgccggggagcccgtgctttatcagcccggcgtgccggtctatgaagcG 

Tatgccaacacgctgctgcatctgagccggctttccaccttgcgccaacgggtcggcaat 

Cgcctttatgatccggcagatgtcggccgcaacggcgtatggagccgcgtcgagggctcc 

Gcgagccagctcgatccttccgcgtccacgactggcgaacgccaggacgtcgatagctgg 

Aaagtgcagttcggtgtcgaccgtatcctggccggcgggcaagagggctcccgcctggtg 

Ggcggattggcgctgcagtacggcaaggccgacacgcgcgtgtcgtcgatatacggcaat 

Ggcactgtcgacgccacggcctatggcctgaccccgacgctgacctggtacggcagggac 

Ggcgcctatgtcgatgcccaggcccaggcgatctggttcgacagcgacctgagttcacgg 

Ctggccggcaagctcaaggatggccggaaagcgcatggctatgggctgggtatcgaagcg 

Ggcaaggccttcggattgcgggaggggctggccctgatcccgcaggcgcaattgtcgtac 

Gcatcgacccgcttcgacagcttcgacgacagattcggcgcccgcgtcgaagacgataag 

Ggcgacagcctgcagggccgtctcggcatcgcgctggactacaagagcagctggcaagcg 

Ggcggcgcgaaccgggagtcgagtgtcttcggcatcgtcaatgtgaagcatgagttcctg 

Gatggcacgcgcgtgcgcgttgccggcgtgccggtaagcagccgcatggcgcgcacctgg 

Ggcagcgtgggagtgggggccgattacggttggggagaacgctacgccatttacggccag 

Gtggacgccgatgcagatttcgccggcagctacatcgtcaccgcgaccgcggggttcagg 

atgatgttctag 

SEQ ID NO:40 polypeptide sequence of OrCZO 

MVGRSCHRAGWLYRATFLLYAANCAYPANAQSVSGSGQVSNGPITSPHWVVGGEIiIVGDT 
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GAGTIiIiIEAGGTVIiNDWAYIGSDNGyW^GTLTVSGRTCAGAASTWTTTO 

TLEVLGGAiy^QSGWGTIGVAAGSVGSVWSGPGSVWNIATVNSFQIGSGGSGTLWIDQGG 

AVYSGQGVIGWNPGSDGHVTVIiGPATVWNPLNNIYVGLGGTC 

GAAASIYXGTSAGSAGTVTVSSATAVTSTLTSTDRIEIGSAGAGVLTYAKGGMVGVASDA 

WIAITGTSSGTimiTGDASGRGVLETGSVIKGAGNATPNIiIXK5VliR^ 

QAVGSGGAWFDTNAHDVGVVTAFSGTSSFNKLGAGTLTLSGNSAAPTGI^ 

GVLGGPVDVTiAGARLTGTGRVGATANKGTIAPGPRSGFGTLTIAGDYAAQGGNLEIRTQL 

GADDSPTDRIjVITGASAGVTPVTVENIGGTGASTQRGIQWQVNGASAGRFNIiANGD 

EGRPAIiVAGAYGYVLQQDAADGDT^miKSSLPDPGAPQGGGGLPGAGEPVLYQPGVPVYEA 

YANTLLHIiSRIjSTLRQRVGNRLYDPADVGRNGVWSRVEGSASQLDPSASTTGERQDVD 

KVQFGVDRIIJ^GGQEGSRLVGGLALQYGKADTRVSSIYGNGTVDATAYGIiTPTLTP^ 

GAYVDAQAQAIWFDSDLSSRIiAGKLKDGRKAHGYGIiGIEAGKAFGLREGIiAblPQAQIiSY 

ASTRFDSFDDRFGARVEDDKGDSLQGRIjGIAIiDYKSSWQAGGANRESSWGIVNVKHEFIi 

IDGTRVRVAGVPVSSRMARTWGSVGVGADY6WGERYAIYGQVDADADFAGSYIVTATAGFR 

MMF 

SEQ ID NO:41 polynucleotide sequence of Orf21 

Atgccgtcacccgatgccttgccgcacacgccgcctgcttcaggcggcgatcgcgtgatc 
Agcgggatcctgcagcaggacctcggcagttggctggcgccggatgccgcaaagcgcagc 
Ccctccgagcctggcaaggcggccgaaaaaatcggggtaatgccgaacgaggacctcggc 
Aagtggctggttccgggggcgcaaaagaacaatccgcccgagcctggcaagacgctggac 
GaaatccgTgcgggtctcgaaaaatgggtggcgcccgggtccaagccgcccgtcgaaccg 
Gatccggacaaggcgacgcaggcgtatcgcaaagacctcgataaatggctggcgcctccg 
Gccaagtccggcccgcccgaagcgccacccgtcgtccaacccgaagcgccgccgcaagcg 
Caacctgaggcgccgcctgtcgtgccgccgccggccgagccgccagcagctcgaccgccg 
Gccgttccgcccgcgcggccggccggcgacgcggtgtacgtgccgggcacgcgcacgctg 
Acgccgacggccaacgcggcggtgggcacggccagcgccgcgcaaggtctgtggcaggcc 
Gagatgaacgcgttgagcaagcgcatgggcgagttgcgcctgacgccggttgcgggcggc 
Gtatggggccgcgcttttggccggcgccaggacgtcgacaaccgcgtgtcgcgcgagttc 
Cgccagaccatcagcggtttcgaactgggcgccgataccgccttgccggtggccgacggg 
Cgctggcacgtgggcgcggtggctggctacaccaacggccgcatcaagttcgaccggggc 
Ggcacgggcgatgacgacagcgtgcacgtgggcgcttacgctacctacatcgaggacggc 
Ggtttctatatggatggcatcgtgcgggtcagccgcattcgccacgcgttcaaggtggac 
Gacgccaagggccggcgcgtgcgcggccagtaccgcggcaatggcgtgggcgcgtcgctg 
Gaactgggcaagcgcttcacgtggcccggcgcctggtacgtggagccgcagctggaggtg 
Gccgccttccatgcgcaaggggccgactacaccgccagcaacggcctgcgcatcaaggac 
Gacggcacgaactccatgctgggccgcctgggcctgcacgtggggcggcagttcgacctg 
Ggcgatggccgcgtggtgcagccctacatgaagctgagctgggtgcaggagttcgacggc 
Aagggcacggtgcgcaccaacgacatccggcacaaggtgcggctcgatggcggccgcacc 
Gaactggccgtaggggtggcttcgcaactgggcaagcacggcagcctgttcggctcgtac 
Gagtacgccaagggcagccgccagaccatgccgtggaccttccacgtcggctatcgctac 
gcctggtag 

SEQ ID NO:42 polypeptide sequence of Orf21 

MPSPDALPHTPPASGGDRVISGIIiQQDLGSWIjAPDAAKRSPSEPGKAAEKIGVMPNEDI^ 

KWLVPGAQKNNPPBPGKTLDEIRAGLEKWVAPGSKPPVEPDPDKATQAYRKDIjDK^^ 

AKSGPPEAPPWQPEAPPQAQPEAPPWPPPAEPPAARPPAVPPARPAGDAVYVPGTRTL 

TPTANAAVGTASAAQGLWQAEMNAIiSKRMGELRLTPVAGGVWGRAFGRRQDVDNRVSREP 

RQTI SGFELGADT/UjPVADGRWHVGAVAGYTNGRIKFDRGGTGDDDSVHVGAYATYI EDG 

GFYMDGIVRVSRIRHAFKVDDAKGRRWGQYRGNGVGASLELGKRFTWPGAWYVEPQIiEV 

AAFHAQGADYTASNGLRIKDDGTNSMLGRIiGLHVGRQFDLGDGRWQPYMKIiSWVQEFDG 

KGTWTKTOIRHKVRLDGGRTEIiAVGVASQIiGKHGSLFGSYEYAKGSRQTMP 

AW 

SEQ ID NO:43 polynucleotide sequence of Orf22 

Atgtgcgacacctgcagagatgatgatggcacctcgccttcgattcgcgtccaaggcggg 
Gttgttcagggcggcatgggtgcaaataacgtcgctgtggtggcaacagggtctggaaag 
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Gtcgcgatcgagaatgcggaactgctcggagccagcggcatgtacgccacgttcggcgcg 
caggtcgatatgaaaggcgggcgcattctggcgcacaacaccaatatcctgggaagccag 
ggttacgccgatggtccctatggcggcgtggtcgtgacagaggacggtcaagtcaacctg 
gagggcgccaaggtcagtgcaactggcctgggggccgccggcttgtggttgctgggcgac 
aaggacaccagcccgcgagccagcctgcgcaacaccgacgtccacggagaggtcgccgcc 
attgcgctggggttcaatggcgaggcgaacatctcgggcggcagcttgagcgtagaggat 
ggggccgtgctcaccaccctgacgcccgatgcagtcgagtattactacgactacgccttg 
tccatggagcatctgccagctgatgcgccgttgacgccggtccgcgtcacgctgtccgat 
ggcgcgcgcgccagcggagaaacgttgatcgcgcatggcgggttgttgcccatgacgctg 
cgcttgagcagcggggtcgacgcccgcggcgacatcgtcacgctgccgccttccgcgccg 
cccgattccgcggagcaaccggatgccgagccggaaccggatgccgagctggaaccggac 
gccgcggcgcagtcggacgccaaggcgaatgcgcgggtcatggcgcaggtagatggcggg 
gaacctgttgccgtgccgatcccggccccttcgcatcccgatgccccgatcgacgtgttc 
atcgacagcggtgcccaatggcggggcatgaccaagaccgtcaatgcgttgcgcatcgag 
gacggcacctggaccgtcaccgggtcgtccacggtgaacagcctgcacctgcaggcaggc 
aaggtggcgtacgcaacgcctgccgaaagcgacggagaattcaaacacctgcgggtcaag 
accctctcgggaagcggcctgttcgagatgaacgccagcgccgacctgagcgatggcgac 
ctgctggtcgtgtccgacgaggccagcgggcagcacaaggtgctggtgcgaggagccggc 
acggaacccaccggtgtggaaagcctgacgctggtcgagctgcccgagggcagccagacg 
aagttcacgcttgccaaccggggcggggtggtcgacgccggcgcgttccgctatcgcctg 
acgccggacaacggtgtctggggcctggaacggaccagccagctttcggccgtcgccaac 
gcggccttgaataccgggggcgtgggcgcggccagcagcatctggtatgcggaaggcaat 
gcgctctccaagcgcctgggcgagttgcggctcgatcccggcgcgggcggcttctggggg 
cgcacgttcgcccagaagcagcagctcgacaacaaggctggccgacgcttcgaccagaag 
gtgtacggtttcgagctgggggccgaccatgccatcgcaggacagcaagggcgctggcac 
gtgggcggcctgctgggctatacccgcgcaaggcgcagcttcatcgatgacggcgccggg 
cataccgacagcgcgcatatcggggcctacgcggcgtacgtggcggacaacggcttctat 
ttcgattcgaccctgcgcgccagccgcttcgagaacgacttcacggtaacggccaccgac 
gccgtttccgtacggggcaagtaccgggccaatggggtaggcgccaccttggaggccggc 
aaacgtttcacgttgcacgacggctggttcgtcgaacctcagtccgaggtgtcgctgttc 
catgccagcggcggaacctaccgtgccgcgaacaacctgtcggtcaaggacgaaggcggc 
acct ccgccgt gc t gcgcct gggct t ggcggccgggcgacgcat cgacct gggcaaggac 
cgcgtgatccagccctatgccaccctgagctggctgcaggaattcaaaggcgtcacgacc 
gttcgcaccaacgggtacgggctgcgcaccgacctgagcggtggccgggctgaattggcg 
ctgggcctggccgccgcgttggggcgcggccaccagctctacacttcgtacgagtacgcc 
aagggcaacaagctgaccttgccttggacgttccacctgggctatcgctacacctggtag 

SEQ ID NO:44 polypeptide sequence of Or£Z2 

Ma)TCRDDDGTSPSIRVQGGWQGGMGA]SnsrVAVVATGSGK\^ 

QVDMKGGRIIiAHNTNIIiGSQGYADGPYGGVVVTEDGQVNIiEGAKV^^ 

KiyrSPRASLRNTDVHGEVAAIAliGFNGEANISGGSLSVEIXSAVLTTriTPDA 

SMEHLPADAPLTPVRVTIiSDGARASGBTLIAHGGLLPMTLRLSSGVDJ^GDIVTIiPPSAP 

PDS AEQPDAEPEPDAELEPDiU^QSnAKANARVMAQVDGGBPVAVPX PAPSHPDAPIDVF 

IDSGAQWRGMTKTVNAIiRIEDGTVmnPGSSTVNSLHLQAGKVAYATPAESDGE 

TLSGSGLFEMNASADLSDGDLLVVSDBASGQHKVLVRGAGTEPTGVESIjTLVELPEGSQT 

KPTLAI^GGVVDAGAFRYIUiTPDNGVWGIjERTSQLSAVANAALOT'GGVQ^ 

ALSKRLGELRLDPGAGGFWGRTFAQKQQLDNKAGRRFDQKVYGFELGADHAIAGQQGRWH 

VGGLIiGYTRARRSFIDDGAGHTDSAHIGAYAAYVADNGFYFDSTIjRASRFENDFTVTATD 

AVSVRGKYRANGVGATLEAGKRPTLHDGWFVEPQSBVSLFHASGGTYRAAlSnJJ^ 

TSAVIjRLGIiAAGRRIDIiGKDRVIQPYATIiSWLQEFKGVTTVRTNGYGIiRTO 

LGIiAAT^RGHQLYTSYEYAKGNKLTLPVJTPHIjGYRYTW 

SEQ ID NO:45 polynucleotide sequence of Orf23 

Tt gcgc cagaccacgccggt gccggt gcggct cgt cc t gcgcggcgcggcggt cgcgcag 
Ggcgatgtcgtgcgcgcgcccgagacggcgccggagaaggatgggttcggcacgcccgtg 
Cggccgggcttgcgcgtcgggctggaccaggcgccgctcgagctcgatgtggccgacggc 
gcgcagtggcatggcgcgactcagtcgcttgacaggctggccctgggcgcgggcggccaa 
tggcgcatgagcgcggcatccagcgtgggcgaactgagcatggagcctggcgcggccgtc 
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gtgttcggcgatgcggccggaccgggttttcaaacgctgacggtgcgcaccctggcgggc 
gccggttcgttcgagatgcgtgcggacgccgcgctggagcatgccgatcaactggtggtg 
accgaccaggccgaagggcggcatcgcgtgtggttgcgcgcgccggccggcgccgagccg 
tcgaaggcacaggccgtgctggtgcgcgcgcccgcagacggcaaggccagtttcgaactc 
gacggcagcgacggcagggccgacttcggcacctatcgctacgggctggcgcagcagccg 
ggcggcgcctggggcctagtcaggacggggtattcgtccaccgccgccgcggcgctggat 
accggcggactgggcgcggtgcaggggttgtggtatgccgaatccaacgcgttgggcaag 
cgcatgggcgaattgcgcctgaacccggacgccggcggcgcctggggccgggcgttcagc 
cagcgccagcgcatcagtccgcgcgcgggccggcatttccagcaaggcgtcagcggcatc 
gagctgggcgccgaccgggcctggcccgtggccggcggccgttggcatgcgggctggttg 
ctgggct acacgcgcgcgt cacgcgggt 1 1 1 ccggccagggaaaggggcacaccgacagc 
gtgcacgtgggcggctatgccacctatatcggcgccaatggcgtgtacgccgatgccacg 
ctgcgcgccagccgcttcgagaattcgttcgacgcacctggctgggcggggcgcaccgtg 
tccggcagctaccgcgccaatggcgtgggcgtgacgctggaggccggccggcgtctggcg 
ctggaccggcactggttcgtcgagccgcaggccgaactggcgtggtttcgtgccggcggc 
ggtacgtacacggccagcaatggcctgcgtatcgaggatgacggcggcacgtcgctgcag 
gcgcgggrtaggcgcgcaagccggccgccgcttcgacttgcgcggcggcgcggtggtgcag 
ccc t acgcgcagctgagt tgggtgcaggaact caagggcgtgagcacggt gcgcaccaac 
ggcatcgcgcaccgtaccgacctgggcgcgggccgcgtcgaactgggactgggcgtggcg 
gccgcgctgggcaagggccacaatctgtacgcgtcgtacgagtacgcgcacgggcccagg 
ctcagcctgccgtggaccgtgcagctgggttaccgctacgcttggtaa 

SEQ ID NO:46 polypeptide sequence of Orf23 

LRQTTPVPVRLVLRGAAVAQGDVVRAPETAPEKDGFGTPVRPGLRVGIiDQAPLEIjDVADG 

AQWHGATQSLDRIjALGAGGQWI^SAASSVGELSMEPGAAVVFGDAAG 

AGSPEMRADAALEHADQLVWDQAEGRHRVWLRAPAGAEPSKAQAVIiVRAPADGKASFEL 

DGSDGRADFGTYRYGIiAQQPGGAWGLVRTGYSSTAAAAIiDTGGIiGAVQGIiWYAESNAIiGK 

RMGBIiRLNPDAGGAWGRAFSQRQRI S PRAGRHFQQ6VSGI KXjGADRAWPVAGGRWHAGP^ 

IjGYTRASRGFSGQGKGHTDSVHVGGYATYIGANGVYADATIiRASRFENSFDAPGWAGRTV 

SGSYRANGVGNTTLEAGRRIjAIjDRHWFVEPQAEIAWFRAGGGTYTASNGIaRiro 

ARVGAQAGRRFDIiRGGAWQPYAQIiSWVQELKGVSTVRTNGIAHRTDLQAGRVEIiGLGVA 

AALGKGHNLYASYEYAHGPRLSIjPWTVQIiGYRYAW 

SEQ ID NO:47 polynucleotide sequence of Or£Z4 

Ttgccttcgccgcccgaagaggcgcctcaggcggggcccgacgcgtccaagcagcggccg 
Gagggcctgcccgcgcccgatgccaatccccaacccgatgcaaagcccggagctgagatg 
Aaaccccggcctggggtggaacccggacctgaggcggaacctggtccgcaggggcagcct 
ggcccccagcctggagcccggccgcaggacgagccgcacgcgcagccgctgccgcccgcc 
ggcaaccccggcgctgggatttacatgccccgcagcggcatcttgaccgcaccggttctg 
gccgtgctgggcacggccagtgcgccgcaaggtatctggcaggcagagatgaacgccctg 
agcaagcgcatgggtgaattgcggcttacgccggcagccggcggcgtgtgggcacgctcg 
ttcgcgcaacgccaacgcctggacaatcaggtggtggacaggttcacccagaccgtgggc 
gggatcgagattggcgccgacacggccttgccggcggccgaggggcgctggcatgtaggc 
gcggtggccggctacagccgtgcgcgccgcaagctggcgcacagcgcccgtggcaacagc 
gacagcctgcatgtgggcgcctatgcgacgtatatcggcgacggcggcttctacctcgac 
gggattgtgcgggtgaaccgctacgagcacgatttcagggctgacggccagcgcggcgcg 
cgcgtgacgggcaagtatcgcgccaatggcatcgggctgtcgctggagaccggcaggcgt 
- ttcaca trgggccggcgactggttcgtggaaccgcaggtcgaagtggcgttgttccgttcg 
95^^9M3-^=pSCtscacggccagcaatggcgtgcgcgtcgacgtggcaagcaccaagtcg 
ttgctgggccgggcaggcctgcaggtgggacgcaagctggatctgggcaacggcaaactg 
gtgcagccgtacgccaagctgagctggttgcaggagttcgatggcgtgggcaaggtgcgc 
accaacgatatcggccatgacgtcaaactgcggggcgggcgcgccgaactcgacttgggc 
gtggccgcggcgcttggcaggcacagcagcctgtttgcttcgtacgagtacagcaagggc 
agccgcttgaccattccgtggagctttcacgtcggctatcgatacgcctggtaa 

SEQ ID NO:48 polypeptide sequence of Ort24 
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LPSPPEEAPQAGPDASKQRPEGIjPAPDANPQPDAKPGABMKPRPGVEPGPEAEPGPQGQP 

GPQPGARPQDEPJiAQPLPPAGNPGAGIYMPRSGILTAPVIAVLGTASAPQGIWQAEI^^ 

SKRMGELRLTPi^GGVWARSFAQRQRLDNQVVDRPTOWGGIEIGM 

AVAGYSRARRKIiAHSARGNSDSLHVGAYATYIGIXK3FYIiIX3IVRV^ 

RVTGKYRANGIGLSIlETGRRFTWAGDWPVEPQVE^^PRSC^^ 

LLGRAGLQVGRKLDLGNGKLVQPYAKLSWLQEFDGVGKVRTNDIGHDVKI^ 

VAAAIiGRHSSIiFASYEYSKGSRLTIPWSFHVGYRYAW 

SEQ n> NO:49 polynucleotide sequence of Orf25 

Atgagacggttaaaggcccaggctttcgagggcagccgcagcaggccggcaggacatggg 
Gtggcgcctaccttgctggcgctggccctggggttccagggggcggcggcgtgggccaat 
Tgcacgacgtcaaacggtgctaccacttgcaccaacgccaacggctctcataccaacaag 
Gtgggcagtggaccgagcgggatgaacgaacgcgtcaccgtgaatcagggggcgcgcatc 
GAgacaaacgccagcgcggcgatcagtgtgggaacgagcgggcaggtacgaatcgagggc 
Ggtgcagtagtgcaaagcacggtcaatactgctgcgtccggccagtacgccaaaacgctg 
Gaagcagcaagcaataacaatatttccatccaagtaaacgcgcagctcctggccaagggc 
Agcgcttcgcagtccagcgcgttgggattgtcaggcgccggcaataccgtcaccaaccat 
Ggcacgatccgggccgataatgccgcggcaatctgggtcactgccaataccgccaatgcg 
Gccaataccatcgataactacgggactatcgaaacagtgctcaatggcggctacgccaac 
Gccatcggcagcacgcggaacaacagtgccacgggcgctggcgtgacggtacgcaatcat 
Gccaacggacgcatcgtcggcaacgtgaagttcgaggctggcgacgacagcgtcatactc 
Gacggcggctctaccatcaccggatccttgaacggtggcagcggcaacaacagcctgacg 
Ctgaaagccggcgacggcacgctgggccgcgcaatccgcaacttcggcacgatcaccaag 
Caggaggctggaacctggaccctgaatggccaggtcggccgcaacgacaacaacctcaag 
tccacggtcaaggtggagggcggcacgctggtcttgcgcggcgataacagcggcgccacc 
cagggcggcgtgttgcaggtgtccgccggcgctacggcggacgtaactgccgccagcgcc 
atgcagtccatcagcaacgccggcacggttcagttcacgcaggacagcaatgccgcctac 
gccggcgtgctgagcgggaccgggagcatcgtcaagcgcggcggcggcgacctgacgttg 
acgggcaacaacacccataccggcaaggtggtggtggaggcgggcagcctcagcgtatcg 
gcggccaacaacctgggtggcgcaggtagttcggtacagctcaagggcggcgccctcgcc 
ctcaagaaaaccatcgtcgtcaatcgcggcctgacgctcgattccggggcgcagacgttg 
atcatcgagccgggaacaaccacgacctggcaaggccaggttagcggcgccggcaaactg 
gtgacccagggcggcacgctggtgctggagcacgcgtccaatacgtatagcggcggtacg 
gagatcaacaacggaacgctgcgggcggcgcatgatgccagcctgggttccggcacgttg 
gcgctcaagaacagccagctggccgccacggacagcttcacggccacgcgtgcattgacg 
ctcgctggaaacgaaagcatagacgtcgcagccaccaagatactcagttggaacggcgaa 
atcagcggcgccggcaccctggtgaaggaaggccaggggaccttgctgctgcgcggaacc 
aatcagcaaaatggcggcacgaccgtcaatgccggtacgctgcagatatcccgcgacgcc 
aatcttggccgaggggcgctggcgctgaacgacggcacgctgcagagcaccggcagcttc 
gcgacctcgcgcgcggccaccttgcgcggccaggccaccatggaggtcgacgcttcgcat 
accgtgacctggaatggcgagctgagcggcggcggcatgttgcgcaagtcaggccagggc 
acgctggtcctggccggcgccaacacgtactcgggtggcacggtggtcgaggccggcgcg 
cttcgggcaggacacgaagacaacctgggacggggcgcaataaccctgcagggcggagat 
ctgcttgccggcggcagtttttcgagcaaccgcgatctcacgcttgtccgcggttccttg 
gacgtggctcgcgacgctaccctgacctggagcggcgcgatatcgggcgccggcgatctg 
gtcaagaaaggggacgggcgcctgacactcacgggcgtcaacgagtacgccggccagacc 
gtgctccggggcggcaagctgcgtgtggccagggacgaaaacctgggccgcggagcactg 
gtgctggaagacaataccgtgttcgagagcatgggctcgcatgccgccacgcggcaggtc 
acgctcaagggcgcgcccaaggtagagacgcttgacggcactacgctcgaatggcgcggc 
acggtcgacggcgacggcaagctgtacaagcaaggcggcggcacgctcgtgctgagcggc 
aacaatacctacgccaagggcgtcgaggtctggggcggggtcgtgcaagtctctcgcgac 
cagaacctgggcgcggccaatggcgcggtcacgctcaacggcggcgggttggcggccaac 
9999atttcacGagcaatcgccagctggagctgaccgccggggccaaggccatcgacgtc 
gcggccggcaaggacgtgacgtggcgcggtgtcgtcaacggcgccggcgcgctgaccaag 
gccggcgacggtacgctggcgctggccggcgccaacacctacaccggcggcacgcgcttg 
cagggcggcaccgtgcaggtatcgcgcgacaacaacctcggccaggccgccggcgcggtc 
acgttcgacggcgggcggctggccaacacgggcagctttgcgaccgcgcgcacggccacg 
ctcaacaaggctggccagatcgataccgaccggggcaccacgctgacatggaacggcgcc 
atcggcggcaagggcgagctgcgcaagcaaggggcgggcaccctggtgctgggcggagcc 
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Eiacacttaccagggcgacacccgcgtcgaggccggcacgctgcaagtgtcggccgacgcc 
aatctgggccagggcgccgtgcatctgcacgacagccggctggcgacgaccggtacgttc 
gcgacctcgcgccgtctggagttgaccggacgtggcgcggtgcaagcggctgccgccgcc 
acgctggattggcgcgggacggtcgctggcgccggcacgctggtcaaggagggcgcaggc 
acgctggtgctggccggtgacaaccagcatgccggcggcaccgaggtcagggccggcacg 
ctgcaggtatcgcgcgccaccaacctggggcccggcgcgctggcgctggagaacgcggcg 
Gtggccacgaccgccagcttcacggccacgcaggcagccaccctgactggcaacgccgcc 
atcgacacggccgccggcaccacgctgggatgggagggggccatcggcggaaccggcagc 
ctgcacaaaaagggcgagggcaagctggtgctggtcaaggacaaccaccatgacggcggc 
accacgatccacgccggtaccctgcaggtgtcgcgcgacgccaacctgggctcgggacag 
agcgcggtgacgctggatggcggcgccctggcggtttctgccgggttctccagcgggcgc 
gagatcgtcgtgggcgccgggcacggcgcgctttcggtgacgggcggccacaccctgcaa 
t ggcagggccaggt cggcggggcgggggcgt t gaccaagacgggcgacggcacgct cgt g 
ctggagcacgacaatacccacgccggcggtacccggattaccggcggggtgctgcgcgtc 
tcgcgcgatgagaacctgggcgaggcgcatggcatgctgacgctcgacggcggcacgctg 
tcgaccaccgccgggttcgcgagccggcgcaacgccaccgtgggcaacggcggcggccgg 
atcgtcgtcgccgacgccgccacgctggatttgcagggcgacgttgccggcgcgggccgg 
ctggtcaaagagggcgcgggcacgctggccttgggcggcacgaacacctatgccggcggc 
accgtggtcgaggccggcacgctgcgggtcgcgcgcgacgccaacctgggcggcggcgcg 
ctgaccctgaacaacagccgcctgcatgcgaccgccggctttgccaccggccgcgatgcg 
accctctccgggcgcgcctcgatcgacaccgacgaccgggcgacgctgcaatggcgcggc 
acggtcaatggcgccggcaggctggtcaagcagggcctgggcaccctggtactggacggc 
gacaaccggtacgcgggaggcaccgaggtcaatgccggcacgctgca.ggtcgcgcgcgac 
gccaacctgggcgcgggcgacgtggcgctcaatggcagcagcctggccgcgaccgccagc 
ttcgccaccgcgcgcacggccacgctgagcggcgcggccgccatcgacacggccgacggc 
gccacc 1 1 ggact ggaat ggcc t gc t cgacggt gacggcgccc t ggt caagcagggcaac 
ggcaccctggcgctggccgcggccaaccgctatggcggcggcaccatcgtcaaggcgggc 
gccgtgcggatcgcccgcgacgccaacctggggcgggccggcaccggcgtaacgctggac 
ggcggcgcgctggccaccacggcggatctcgcgaccggccgcgcggcgaccctgggcgcg 
gccaacggcacgctggacgtggccgccggcacccgcctggactggaacggggcgatcggc 
ggcgccggcgcgctgaccaagaccggcgccggcaccctggcgctcaaccacgacaaccag 
catgccggcggcaccctggtccatggcggcacgctgcggatcgcccgcgacgccaacctg 
ggcgcggcgggcacggcggtgacgctggacggcggcacgctggccaccacggcgtcgttg 
gcgcccgagcgcgcgctgcgcgtcggggcgcgcaacggcgtattgctgccggacgcgggc 
acgaccctggattggcggggcgtggtcgccggcgcgggcaagctgaccaaggccggtccg 
ggcacgctggtgctcagcgccgataaccgccatggcggcggcacggcagtcaccggcggt 
acgctgcaagtttcgcgcgacgccaacctgggcgcggcggccggcgcGCtgacgctggac 
Srgcggcacc 1 1 gc tgagcaccgccagc 1 1 tgcc t cggcgcgt oft cgccaccc t cgat gcc 
gcgggcggcaccttcgtcacccgcgacggcacccggctggattgggacggcgcgataggc 
ggggcgggtggcctggtcaaggaaggcgccggcgagctgcggcttggcaatgccaatacc 
taccaggggccgacccgcatcgccgccggccgcctggccgtcaacggcagcatcgccagc 
ccggtcacggtcgagcaggccggcgtgctgggcggcacgggccgcatcgtcggggatgtg 
gccaaccgcggcgtggtcgcgccgggcaactcgatcggcgcgttgacggtggccggcaat 
tacgctggtacgggcggcagcctggaagtggaggcggtgcttggcggcgacgccgcgccg 
gccgatcggctggtgctcgacggcggcgcggccagcggcgtcacgccggtcgtagtcaag 
ccgcagggcggggtgggcggcctgaccctgcgcggcattccggtggtcgtggcccagggt 
ggcgccacgaccgcgcccggggccttccgcctggcgcagccgctggtcgcgggcgcctac 
gagtaccagttgctgcgcggcgcgggcgacggcgccgcggcgcaggcgcaagactggtac 
cJtgcgtacctcccgcgtcgagcgcgacaaggcgggcaggatcgtcaaggtcgtgcccttc 
taccggcccgaggtggcgctgtatgccggcacgcccatgctgatgcgcatggtcggcacg 
gaagcgctgggcagctaccgcgaacgcgcgggccagcccggcgcggccgcgcccgaggca 
ggcgccgcagcccggcgtggcgtgtgggcacgtaccttcgggcgtcgtttcgagcgctcc 
3G3S3cagcgaagcggcgccgtccttcaacggcagcctggccggcatgcagctgggcgcg 
gacctctacacgcgtcgctcggccacccggcatgccgacgccttcggcgtgttcggcgga 
tacgccacggcccgcggcgatgtgcggggcctggcgcgcggcgagatccaggcggtgggc 
acgtccacgctgcgggccgcccagctgggcgcctactggacgcacactggtccgagcggc 
tggtacgtcgacacggtgctggcgggcacgcgctacaagcagcagaccagctcgtcggcc 
catgtcggcgcgaccagccgcggctggggcatgatggcctcggtggaggccggctacccg 
tggcagctcaafcccgcgctggcaaatcgagccgcaggcccagttggtgtatcagcagctt 
ggcatcgccaatggcgccgaccgcgtgtcttcggtgtcgtacaagacgcccgatgcgctg 
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acggggcggctgggcacgcgcctggcgggccagtacgcatacgggaaggcgcagttgcgg 
ccgttcatgggcgtatcgctgctgcacgatttcaccggcgccgacaccgtcacgttcgcg 
ggcgtgcacagcgtacgcgccagccgccagaacacggccgtggatctgaaggcgggcgtg 
gacacgcagctgggcaagagcgtaggcctgtgggggcaggtcggctacggcaagtcggtc 
ggcagcggcgacggcagcgaccgtggctggagcgccaacctggggctgcgcgtggcgtat 
tga 

SEQ ID NO:50 polypeptide sequence of Orf25 

MRRLKAQAPEGSRSRPAGHGVAPTLIiALALGFQGAAAWANCTTSNGAT^ 

VGSGPSGMNERVTVNQGARI ETNAS AAI S VGTS GQVR I EGGAWQSTVNTAASGQYAKTL 

EAASNNNI S IQVNAQLLAKGS ASQS STUjGIaSGAGNTVTNHGT IRADNAAAI WVTANTANA 

ANTIDmrGTIETVLNGGYANAIGSTRNNSATGAGVTVRl^^^ 

DGGSTITGSLNGGSGNlSISLTLKAGDGTLGRAIRNFGTITKQEAGTVn^IjNGQVG 

STVKVEGGTLVLRGDNSGATQGGVLQVSAGATADVTAASAMQSISNAGTVQFTQDSN?^ 

AGVLSGTGSIVKRGGGDLTLTGNNTHTGKVVVEAGSLSVSAANNLG(^ 

IjKKTIWNRGLTIjDSGAQTLI IEPGTTTTWQGQVSGAGKLVTQGGTLVLBHASNTYSGGT 

EINNGTLRAAHDASLGSGTIlALKNSQIlAATDSFTATRALTIlAG^^BSID 

ISGAGTXiVKEGQGTLLLRGTNQQNGGTTVSTAGTLQISRDANIiGRGALAI^ 

ATSRAATIjRGQATMEVDASHTVTWNGELSGGGMLRKSGQGTLVIAQAlSn^ 

liRAGHEDNLGRGAlTLQGGDLIiAGGSFSSNEODLTIjWGSIiDVT^ATIiTWS SGAGDl, 

VKKGDGRIiTLTGVNEYAGQTVIiRGGKriRVARDEl^GRGALVLEDNT^ 

TLKGAPKVETLDGTTLEWRGTVDGDGKLYKQGGGTtiVLSGNlSrTYAKGV^^ 

QNIiGAANGAVTLNGGGIiAANGDFTSNRQIiEIjTAGTUCAlDVAAGra 

AGDGTLAriAGANTYTGGTRLQGGWQVSRDmLGQAAGAVTFI^ 

IJJIKAGQIDTDRGTTLTWNGAIGGKGELRKQGAGTLVIXSGANTYQGDTRVE^ 

NLGQGAVHLHDSRIiATTGTFATSRRLELTGRGAVQAAAAATLDWGTVAGAGTL^^ 

TLVLAGDNQHAGGTEVRAGTLQVSRATl^GPGArAriENAAIA 

IDTAAGTTLGWEGAIGGTGSLHKKGEGKIiVLVKDNHHIXSGTTIHAGTIjQVSRDi^ 

SAVTLDGGAIiAVS AGFS SGRE I WGAGHGALS VTGGHTLQWQGQ VGGAGALTKTGDGTLV 

LEHDNTHAGGTRITGGVTiRVSRDENLGEAHGiVILTLDGGTLSa^^ 

IWADAATLDLQGDVAGAGRLVKEGAGTLALGGTOTYAGGTVVEAGTLRVARDAN^ 

LTLNNSRLHATAGFATGRDATLSGRASlDTDDRATIjQWRGTVNGAGRLV^ 

DNRYAGGTEVNAGTriQVARDANIiGAGDVALNGSSIiAATASPATART^ 

ATLDWNGLIJXSMALVKQGNGTIiAIJ^AANRYGGGTIVKAGAW 

GGALATTADIATGRAATLGAANGTIoDVAAGTRLDWNGAIGGAGALTKTGAGTI^^ 

HAGGTLVHGGTIoRlARDANLGAAGTAVTIJDGGTIiATTASLAPERALRVGARNGV^ 

TTLDWRGWAGAGKLTKAGPGTLVLSADNRHGGGTAVTGGTLQVSRDANL^^ 

GGTLIiSTASFASARVATLDAAGGTFVTRIXSTRLDiraGAIGGAGGLVKEGAGELRLGNA^ 

YQGPTRI AAGRLAVNGS IAS PVTVEQAGVIiGGTGRI VGDVANRGWAPGNS IGALTVAGN 

YAGTGGSLEVEAVIiGGDAAPADRLVIJDGGAASGVTPVVVKPQGGVGGLTI^ PWVAQG 

GATTAPGAPRIjAQPLVAGAYEYQLLRGAGDGAAAQAQDWYLRTSRVERDKAGRIVKVVPF 

YRPEVAIiYAGTPMIJyiRMVGTEAtiGSYRERAGQPGAAAPEAGAAARRGVWARTFGRRFERS 

AGSEAAPSFNGSIiAGMQLGADIiYTRRSATRHADAFGVFGGYATARGDVRGIiARGEJQAVG 

TSTLRAAQLGAYWTHTGPSGWYVDTVIiAGTRYKQQTSSSA^ 

WQIiNPRWQIEPQAQLVYQQLGIANGADRVSSVSYKTPDAIiTGRIjGTRIiAGQYAYGKAQLR 

PFMGVSLIiHDFTGAIDTVTFAGVHSVRT^RQNTAVDIjKAGVDTQLG^ 

GSGDGSDRGWSANXiGLRVAY 

SEQ ID NO:51 polynucleotide sequence of Orf26 

Atgttgcgaactggagccccgatgcgtagcgcccgccgccgtaccccagcacgacttgcc 
Ccgttgccggccatgctggccgccgccggcctgctgcaatccttgggcgcgacgcccgcg 
Gccgcggcctgtgcgcccacgctggcgccagccagcgggcagagcgtgcaatgcgacggc 
Gccgtggtcaaccagtcggtcgaggcggcggccggcagccagaacgtaaccatcacggtg 
Gcgcccggcgcgctgttctcgaccaacgcgacgcgggcgctgtcggtcgatgaccgcagc 
Cgtatcgtcaacgagggcacgatccagatggccggcggcgccggcgcctcgcgcggcgcc 
Atggtgggtttcggcgacaacaaccagttgatcaaccgcggatcgatcacgacatcgggc 
Agcggcgtgcgcggcatctcggtgcccaacgtcggcagtaccgggacgctggtcgacaac 
Agcggcagcatccgcacccagggcgccagcgcgcacggcatcgccatcaacggcccgggc 
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Aaccgcgtccagaacagcggcgccatcaccgtcaacggcaccgacgccaagggcgtgtat 
Ctgcaaggcggcagccccgcggccaacgtactggtcaacggcggcaccatccatgcgcgc 
Ggcgccagcagcaacggcatattcggccccgacggcgtgcatgtgaacaccaccaacgcc 
Aacggctttcatgcccgcgtcgagaacctgcccggcgggcgcatcctcagcgatcactcg 
Tatgcgctgcgcgggcagaacggcaacgataccttcatcaacgccggctacctgcaaggg 
Cacggcggggccggccgcgacacggccgtctacatgggcccccagggcacgggcacgctg 
Atcctgcgcaccggctcggccatcgccggcctggccgacggcggcggggcggccagccat 
Gcgtatctggagggcagcggcacggtggacaaccggttcgccaacttccgcaccctgacc 
Atgcgcggcgccgactggcgctggacctcggacgccgcgttcaccgaaagcgtggacctg 
Cgcaccggcacattctttcttgccggcacgctcgccagcccggccaaccgcctggccgcc 
Ggcgcggtgctggccggcaccggcacgctggccggcgcattgcgcaacgccggcgaaatc 
Cggcccggcccgaacgacggcagcggctacggcgcgctgacggtgcgcggcgattacacc 
Ggcgcgggcggcgcgctgcgcgtcaacacggtgctggccggcgacggggccgcctcggac 
Aggctggtcatcgatggcgggcacgccggcggcagcaccccggtcacggtggtcaaccgg 
Ggcgggcagggcgcgctgaccgcggccgacggcatcctggtggtccaggccatcaatggc 
Gcaagctcggacgccggcgccttctcgctggccgcccccctcaacgccggcgcatacgag 
Tacaggctgtaccgcggcggcgccacgggcgccgcgccggacagctggtacctgcgctcg 
Cgggcctatctggtcgaggaccaactggccggcagccttgccgaagccgaggcgatcgcc 
Gacgacatcggccggcgcaccggcgagcggccgagcatcgaggacacgccgctgtaccgg 
Cccgaggtggcgctgtacagcagcattcccatgctggcgcggcgcatgggcctggcccag 
Ctgggcaccttccacgaacggcagggcaaccaggcgctgctggcgcgcgacggcgaacgc 
Gtcgcggcctgggcgcgcgcctatggcggcaacagcaagcaggcgctcgacggcgatgcg 
Caacccggcatcgacgcccgcctggccggcgtgcaactggggcaggacctctacagcagc 
Gtgcgcccggacggcggacagcaccgcttcgggctgttcggcggctatggccaggcgcgc 
Ggcgacacccacggctcggccggcggcgagcgcgacgccgctaccggccggctgaccatc 
Gacggctacagcgtcggcggctactggacctatgtcggcccgcgcgggtggtacgtggat 
Gccgtgctggccaacacctggatggacatcgacaccgactccaaggccgggcgcgacgcc 
Gatacgcgcggccaggcgttcacggcttcgctggaaagtggctacccgctggcgctgtcc 
Gagcgctggacgctggagccgcaggcgcagctcatctaccagcacacgcgcgtcgacggt 
Ttctcggacgccgtgtccgaggtgcgcatccgcgacgacaacgcgctgaccgcccgcctg 
Ggcgcccggctgcagggcgagtacgcggccgccgcgcaggtgtggcgcccctacgcggcg 
Ctgaatttctggcgcaccttcagcggcgagaacaccgtcgtgctgggcgaagacagcatc 
Gatacccggcgcggcgcgacctcgctcgaactggcggccggcgccagcgtgacgctggcc 
Cgcagcctggccctctacggcaggctggcctatgccaccagcatcgacagccagtatctg 
Cgcggcgcttcggcgcagctggggatgcgctacacctggtaa 

SEQ ID NO:52 polypeptide sequence of Orf26 

MI^TGA^MRSARRRTPARLAPIiPAMIiAAAGIiLQSLGATPAAAAC^ 

AVVNQSVEAAAGSQNVTIWAPGALFSTiaAnyUJSVDDRSRIVNEGTIQ!^ 

MVGPGDmiQLINRGSITTSGSGVRGISVPNVGSTGTLVDNSGSIRTQGASAHGIAINGPG 

NRVQNSGAITVNGTDAKGVYIjQGGSPAANVLWGGTIHARGASSNGIFGPDGVHVI^^ 

NGFHARVBNLPGGRlLSDHSYALRGQNGNDTFINAGYLQGHGGAGRDTAVyMGPQGTGTIi 

ILRTGSAIAGIJADGGGAASHAYLEGSGTVDmFANFRTIJTmGADWR\^r^ 

RTGTFFtiAGTIiAS PANRLAAGAVLAGTGTLAGALRNAGEI RPGPNDGSGYGALTVRGDYT 

GAGGALRVNTVLAGDGAASDRLVIDGGHAGGSTPVIVV^ LWQAING 

ASSDAGAFSLAAPlJbJAGAYEYRLYRGGATGAAPDSTrajRSRAYIjVEDQLAGS 

DDIGRRTGERPSI EDTPLYRPEVAIiYSS I PMLARRMGLAQLGTPHERQGNQALLARDGER 

VAAWARAYGGNSKQALDGDAQPGIDARLAGVQLGQDLYSSVRPDGGQHRFGIiFGGYGQAR 

GDTHGSAGGERDAATGRLTIDGYSVGGYWTYVGPRGWYVDAVIjANTWMDIDTDSKAGRDA 

DTRGQAPTASIiESGYPLALSBRVmJEPQAQLIYQHTRVDGFSDAVSEVRIRDDWALTAI^ 

GARIiQGEYAAAAQVWRPYAAIiNPWRTFSGENTVVIiGEDSIDTRRGATSIiEI^ 

RSLAIiYGRIiAyATSIDSQYLRGASAQLGMRYTW 

SEQ ID NO:53 polynucleotide sequence of Orf27 

ATGCCTGCTCAACGC7^CTCCrCGCACCGCGGTTTGT(^ 

CGTTGGATCCATTGCACCGGATTCGTGCTGTGCGCGCTGCTGGCGGCATGCGGAGGCGGT 
GGCGGCGGAGGTGGCGGCGGAGG<:X3GTGGCGGCAGCCCGGGCGGCCGTGa3C(^ 
COBCAACCCGCCCCTTCGCCACGCCCCGAACCTGCACCCGAGCCGGCACCCAATCCTGCG 
CCCAGGCCTGCrCCGCAACax:CGGCK3CCaXX3CX^ 
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CCACCX3GAGGCTCCCCC!GCCCGTGATGCCGCCGCCGGCCGTGCCGCCrCAGCrr^ 
GTGCCXSGCCGCAGACCTGCCCCXSCGTGCGCGCGCCGCTGTaSACA 
CGCACCGACTTCGTCACGCCCT^CCXSGCGGGCCGTTCTTCGCCA^ 
AACACGATCGACCIXSAAGATGGCGCACGACCr^^ 

GCGGTCGTCGACGAAGGCGTGCGCAGCGACCATCCGCTCCTGAACGTCGAGAAGAAATAC 

GGTGGCGATTACATGGCCGACGGCACCCGCACCTACCCCGACCCCT^GCGCCAGGGCAGG 

CyVCGGAACCTCGGTCGCCCTGGTACnGGCCGGACAGGACACCGAC^ 

GTTGCGCCCAATGCCGACCTCTATTaaGCCAACATCGGCACGCGGGCC^ 

GACGAAGCCGCATTCGACGCCTGGAACGACCTGCTCGGGCACGGCATCTV^ 

AACAGTTTCGCCACCGAAGGTCCGGAAGGCGAGCAGCGCGTCAAGGAGGAC 

TACCATAGCGCCGCCAACAAGCAGAACACCTACATCGGACGGCTCQATCGCCTGGTGCGC 

GACGGCGCGCTCCTCATTTTCGCCGCCGGCAACGGO^GGCCATCGGGTCGCGCCTAC^ 

GAGGTCGGCTCGGTCGGACGCACCCCTCGCGTCGAGCCX3CACCTGCa\ACGCGGCCT 

GTGGTCACCGCGGTGGACGAAAACX3Ga\GGCTCGAAACATGG^^ 

GOGCAGO^TGGTGCCTGGCCGCGCCCAGCACCGCCTACCTGCCCG^ 

AACCCCGACAGCATCaVCGTCGAACAGGGCACGTaK:TATCCGCGCCGC^^ 

GCCGCCGTACTGGTGCAGGATCGCITrCGeiG^^ 

TTGCTGACCyVCGGCGCAGGACyu^GGGCCCGTACGGCGTCGACCCGCAGTACGGCTGGGGC 

GTGCTCGACGTGGGCCGCGCCGTGCAGGGCCCGGCGCAGTTCGCCTTCGG^ 

GCCCGGGTTACGGATACCTCCACGTTCGGCAACGACATCTCCGGCGCCGGCGGGCTGGTC 

GTCGACGGCCCCGGCGCGCTGGTCCTGGCCGGCTCCAATACCTATGCCXXXZCGCA 

ATCAAGCXSCGGCACCCTCGACGTCrTTGGCAGCGTCATOTCCGCCGT^ 

GGCGGCACGCTGACCGGCATCGGCACCGTCGGCACGGTGACCAACCAGGGC^ 

AAC^GGAGGCCGGCCTGCACGTCAAGGGAGATTACrCACAGACCXSC^ 

GTCACCGACATCGGCTCXSCTGCTCGACGTATCCGGCAGGGCCAGCCTGGCCGGC 

CATGTGGACGACATCCGCCCCGGCTACGTCGGCGGCGACGGGAAAAGCGTCCCGGTCATC 

AAGGCCGGCGCGGTGTCCGGCGTCTTCGCCACGCTGACGCGCAGTCCGGGCCTGCTGCTC 

AACGCCCGGCTGGACTACCGGCCCCAGGCCGTCTACCTGACCATGCGGCGCGCCGAGCGC 

GTCCATGCCGCGGCGCyVGCXSGGGCGaSKSACGACGGGCGTCGCGCGTCGGTGCTG^ 

GCCGAGCGGCTCGACGCCGCGATGCGCGTU^CTCGATGCCCTGCCCGAGTCGC^ 

GCCGCGGCXSCCGGOSGCCGCCATaSGAaSCATCa^GCGCGTGCAAAGC^ 

CAGGACAACCTGTATTOJCTGGCCGGCGCCACCTACGCC^ 

CTGGAGCAGAACCGCTGGATGGACCGCCTCGAGAACCACCTGGCCCAGGCCGGCGGCGAG 

CGCGTGGCGGCCATCGCCGAGTATCGCCACGGCCAGTTGCGCTGGCGCCCCGATGGCCTG 

CAAGGCCX5CCAGCGCGGCAACGGCATCATGCTGGGCCTGGCGCGCGAAGTCTCGGCCGGC 

CrrGAGCCTGGCOSCGGCGCTGACCCACAGCCGCACGCACTGGGACGAGTCGTCCGGCGCG 

CCGGCCOK^GACAACGCCGCCATGACCACGCCGGGCGTACTGCrrGGGC^ 

TGGGAGGACGGCTGGTTCGTGCAGGGCGCACTGGGCTATAGCCGCTACCGCAACCAGGCC 

ACGCGCCACATCTCGCTCGGCGATGCCGGCCACACCGTCGGCGCCACCGCCCGGGGCCAC 

GTCTGGCAGGCCGACGCCGGCCTGGGACGCCAGTGGACGCrCGCCCCCGGACACACGCT 

GCXSCCT(y2GGCGGGCCTGCPACTCACGCATCTGC^ 

GOSCy^GGACTGGGGCTGCXSCGCCCACGCCITGACGCGCaVCCGTGCCC^ 

CAACTGC»AAGCCGCCATGCCTTCATGCTGGGAGCCAC^ 

CTGGGCGTCTGGCATGACCTCCGaSCGCXSGaSCTACGCOT 

CTGGCGCAGGACCaVGGGCGCCAGCGGCTACTGGCCCGTGCCGCGCACACGCGTAC^ 

GOSCTCGGCCTGCGCGCCGAGTTCGCGCCAGGCCTCGTGCTGGGGCTGGGCTACACGGGC 

CS^GCTTGCCACGCTVCTGGGTCGATCACCAGCTCAGCGCCAGCCTCACTTACCC^ 

SEQ ID NO:54 polypeptide sequence of Orf27 

MPAQRTPRTAVCEATVRSSPRWIHCTGPVLCALLAACGGGGGGGGGGGGGGSPGGRAPSA 

PQPAPSPRPEPAPBPAPWPAPRPAPQPPAPAPGAPRPPAPPPEAPPPVMPPPAVPPQLPE 

VPAADLPRVRAPLSTYRRPQRTDFVTPTGGPFFAKQDKAIJNTIDLK^aAro 

AVVDEGWSDHPLIiNVEKKyGGDYMATCTRTYPDPKRQGRHGTSVAL 

VAPNADLYSANIGTRAGHVSDEAAFHAim^LIiGHGIKIFmSFATBGPEGEQRV^ 

YHSAAimSNTyiGRliDRIiVRDGAIjIiIFAAGNGRPSGRAYSSVGSVGRTPRVEPHIjQRGIjl 

VVTAVDENGRIiETWANRCGQAQQWCnJAAPSTAYLPGLDKDNPDSIHVEQGTSLSA^ 

AAVLVQDRFRWl^^^DNIiRTTLIiTTAQDKGPYGVDPQYGWGVIJDVGRAVQGPAQFAFGD^ 

ARVTDTSTFGNDISGAGGrlVVDGPGALVIJVGS^PIYAGRTOIKRGTIlDVFGSV^ 

GGTIiTGIGTVGTVTNQGTVVNKEAGIjKVKGDYSOTAQGIi^ 
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HVDDIRPGYVGGDGKSVPVIKAGAVSGVFATLTRSPGIiLIjNARLDyRPQAVYIiTMRiy^ 

VHAAAQRGADDGRRASVXiAVAERLDAAMREI^ALPESQRDAAAPJ^ 

QDNLYSLAGATYANAAAVOTLEQmWMDRLEN^^ 

QGRQRGNGIiVIIiGIiAREVS AGLSIiAAALTHSRTHWDES SGAPT^NAAMTTPGVLLGARRA 
WEDGWFVQGALGYSRYRNQATRHI SliGDAGHTVGATARGHVWQADAGLGRQWTIAPGHTIj 
APRAGLQLTHLRQQGFSESGAQGLGIJiAHALTRTVPTLWAQLQSRHAFMI^T^ 
LGVWHDLRARRYAASGGFAGIAQMGASGYWPVPRTRVQGALGLRAEFAPGLVLGIiGY^ 
QIATHWVDHQLSASIjTYRY 

SEQ ID NO:55 polynucleotide sequence of Orf28 

Atgtcgtccccgcgtccccccgcaccttggcgcgcaccgctcgcgcttgccggcctgtcg 
Cttggttgcgctgccggcgcatacggcgcgcccgcgccggcacaaaccgtcgtcaccctg 
Cccgcgcaagaggtcatcggcgacagcgtcgcggcggcccggtccgtgctgcgcctgccg 
gagatcgagcgcgcgcaggccgacaacttcgcctccctggtcgatcagctgccgggcatc 
tcgatggccggctctccgcgccccggcgggcaaagcctgaacatctggggcatgggcgat 
accgaggacgtgaaaatcgtcctcgatggcgcgcccaagggtttcgagaagtaccgccag 
ggctcggtcttcatcgaacccgaactgatccggcgcatcgaggtcgacaaggggccgcac 
aacctggtcgacggcaatggcgggttcggcggcaccgtcaagatcgataccaaggatgcg 
gccgacctgttgccgccgggcgcgcgcttcggcgcgctggccaagtacggccgccattcg 
aacgacggccaggacatctacagcgtggcgctgtacggccgcacccgcgccgacggggcc 
gacggcctgctgtatgccaaccgccgcgacggcggcgatctgcgccgccccgacggcacc 
cgcttcgcatactcgcgcaacaaccagcgctcgctgctcgccaaggtcaacctctatccg 
gacgacgcccagaccatcaccctgtcggccatgcgttcgaatgcggcgggttggcaaccc 
ttcgcggccaagcgcgacgatcttcccgcgccttcgcaggccgatatcgaccgctacggc 
ctgaccgaagcatggcggcgcaagctggtccatcgcgaccagctcgaccagaactacagc 
gcgaaatggaacatcgccccatccgcccatccctgggtgaacctcacgctggcctatgcc 
cgctcggacacccggcagcgcgaccggcgctcgtcccgggcgtcgcagtcggcctttctc 
ggcacgctgggcaacaagagttgggtcgactaccgcgacgaccggttcgacctcagcaac 
gaaagccacgtggccctgggcacggccgagcatgtcctgctggcgggcctgcgctggcac 
cggcatcgccgcgacacgctcatgtactacccgcccggccgcggcgagcccgattacaac 
cacgggtacttccagccgcactacatgccttcgggcacgcagaccgtgcgcagcctgtac 
ctgcaggacgccgtcaccgtcggcggccttaccgtcacgcccggcgtgcggtacgaccat 
gtcgccaataccggcaggccaaacgacgcgccccgctacaacaaccccgcccccgtggcc 
gggcatgactaccgccgcgtctcgtacgcgggctggaccccgcacctgggcgtggtctgg 
aaggcggcgcgaggcgtggcgctgttcgccgacgccggccgcacctggcgcgcacccgtc 
atcgacgaacagtacgaagtgcaatatgcgaagtccaatgtgtcgggcagcagccgggcg 
ctgcggcccgagcgcatcgtgggcctgcgcgccggcgccgtactggattacaacgatatc 
gcgacgcgcggcgacagcgtgcagatacggaccacgctgtttcgcaatcgcggcaagcac 
gagatcttccagcgccgtggcgtggcatgccgcgggcaggccgagggcggcgccgcctcg 
gactgccccaagcccttgtccaactaccgcaacctgcccggctacaccatcgaagggctg 
gaactggagacctactacgacagcccggcgatgttcgccagcctgtcgctttcggccatg 
cgcgggcaccgcgacgcctcgccgcgcgatccatgggggccgcgcacctggatcgccgag 
atcccgccggtctcggcgcgcgcgatgctgggcgtgaaactgccgcgcctggacatggtc 
ctgggatggcgcggcgaattcgtgcgccgccaggaccgctcgccgaccgacggcgacccg 
ctggccggctactgggccttgcccaagaccgccggctacgcgctgcacggcctgttcgca 
agctggcaaccccggcatgtcaaaggcctggacgtgcgcctggccgccgacaacctcttc 
aaccggccctatcatccctacctgggcgaagcggtatcgggcacgggccgcaacatcaag 
ctgagcatcgcccagcgcttctag 

SEQ ID NO:56 polypeptide sequence of Orf28 

MSSPRPPAPWRAPLAIiAGDSLGCAAGAYGAPAPAQTVVTLPAQEVIGDSVAAARS^^ 

EI^aiAQADNFASLVDQDPGISMAGSPRPGGQSLNIWGMGDTEDVKIVIiDGAPKGFEKYRQ 

GSWIEPEIjIRRIEVDKGPHNLVDGNGGFGGTVKIDTKDAADIjIjPPGARFGAIiAK^ 

^IDGQDIYSVALYGRTRADGADGIjriYANRRDGGDIiRRPIXSTRFAYSRNNQRSLIi^^ 

DDAQTITIiSAMRSNAAGWQPFAAKRDDLPAPSQADIDRYGLTEAWRRKLVHRDQLDQN^ 

AKWNIAPSAHPWVNLTIAYARSDTRQRDRRSSRASQSAFLGTLGNKSWVDYRDDRFDIj 

ESHVALGTAEHVLLAGLRWHRHRRDTLMYYPPGRGBPDYNHGYFQPHYMPSGTQTVRSLY 

LQDAVTVGGIiTVTPGVRYDHVAlsmSRPNDAPRYNNPAPVAGmYRR 

KAARGVAIiFADAGRTWRAPVIDEQYEVQYAKSNVSGSSRALRPERIVGLRAQAVinmroX 
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ATRGDSVQIRTTLPRNRGKHEIFQRRGVAOlGQAEGGAASDCPKPLSNyRNIiPGyTIBGL 
EIiETYYDSPAMFASLSIiSAMRGHRDASPRDPWGPRTWIAEIPPVSTUlAML^^ 
IiGWRGEFVRRQDRSPTIXSDPIiAGYWALPKTAGYMjHGLFASWQPRHVKGIiDV^^ 
NRPYHPYLGEAVSGTGRNIKLSIAQRF 

SEQ ID NO:57 polynucleotide sequence of Orf29 

Atgaaggcgcggcgcctggccatggcgggtctgtcgctggcgctcggcggctgctcgctg 
Tcgcagcagatgcaggccatgcgcgacgccgcgacgtccctgcgcgcacgcctgctcgaa 
Gggcagcaggccgtgggccgggccggcgagcggccggcgcgcgaagccgcccaggacgtc 
gcgcggccctggctggccgggcgcgcccagccgctggcacgcgaggtgctgctgccgccg 
gcgctgcgcgccgatgtcgatacgaccctgctgttcgcgggcaaggccacgctgcccgtg 
ctggccgagcgcctgcatcgcgccaccggcatcgccgtgcgcgtgcatcccgacgcgctg 
ctgccgcgcgccgccttcctgccgcgcctggcggggcaggccgagctggcggccgagcct 
cccgcccaggccgaactgcgggccgggccgcgtccgctggccgacacgctcgacgcgctg 
gccgcgcagctgtacgtgcactggcgctaccatcgcggcgccatcgagttctaccgcacc 
gaaacgcgggtcttcgatgtgcgcacgctggcgctggccgccagcgcgcaggctcggctg 
ggccgcgccggcagcggcgagacgggcagtttcgaccatgcctcgagcacggtgctcagc 
gccgacgccggcaaggcgctgcaggccgtgcgggaccgcgtcgccgctttcctgacgcgc 
gccggcgtcatcgccgagatcgaggcgggcggaagcacgctcgcggtcacggatacgccg 
gaggcgctcgcgcgcatcgaaaaatacctgcaaggcgagaaccgcgccctgacgcgccgg 
gtacgcctggtgttcgaagagctcacggtgcgcaccacggccgccgccgaaggcggcatc 
gattggcaggcggtctacgccagcgcgcgcgccgcggcgtcgtacgccatgcccggcggg 
gccggcgcggcaggcgcgctcggggcccgcgtgctggccgggccctggcgcgacgcgcgc 
gccctgatcgccgcgctgagcaccatgggagcggtactgcgccatcgcagcatacccatg 
ctgacgctgaaccggcgcgccgtcacccacgccgtgcgcaccacgttttcctacgtggac 
caggtgcagcgcctgagcccgaccgcggcggcgcccggtgggcgcgatgccgtgcccggg 
ctggcggtgcagcagaagcgcgagacggtgggcacgttcctcacgctgttgcccgaggcg 
cgcgatgacggccgcatcctgctctccatttcctatgacaacaccattgcccagccgctg 
cgcaccctgaccttcggcgagggcggccagcaagtgtcgctgcagcagatcgccatcgac 
ggcagcggcatcgtgcagcaggtcgagctgctgcccggccagcccgtcatcctgtcgggc 
ttcgaccacagcgaagaccaatacgaacgccaccgcctgtttcccgatgcgccgctcgcg 
gccggcgggcacgaccgcacggcgcgcgagcgggtcacgaccgtggtcatggtcaccgcg 
cagatcgacgagggttga 

SEQ ID NO:58 polypeptide sequence of OrQP 

MKARRLAMAGLSIiALGGCSLSQQlVfQAMRDAATSIiRARLLEG^ 

ARPWIiAGRAQPIiAREVLLPPALRADVDTTIiLFAGKATLPVLAERLHRATGIAWV^ 
LPRAAFLPRLAGQAELAAEPPAQAELRAGPRPIiADTLDAIiAAQLYVHWRYHRGAIEFYRT 
ETRVFDVRTIiAIiAAS AQARLGRAGSGETGS PDHAS STVLS ADAGKALQAVRDRVAAFLTR 
AGVIAEIEAGGSTIiAVTDTPEAIiARIEKYLQGENRALTRRVRLVFEELT^ 
DWQAVYAS ARAAAS YAMPGGAGTUVGAIiGARVLAGPWRDARAL I AAIjS™ I PM 

liTLNRRAVTHAWTTFSYVDQVQRLSPTAAAPGGRDAVPGIAVQQKRET^ 
RDDGRILLSISYDNTIAQPLRTIiTFGEGGQQVSLQQIAIDGSGIVQQVELLPGQPVIIiSG 
FDHSEDQYERHRIiFPDAPLAAGGHDRTARERVTTVVMVTAQIDEG 

SEQ ID NO:59 polynucleotide sequence of Orf30 

Gtgaccatgttcatccgctggctcattctctccgcqtgcctgctgctggccgcctgcagc 
Cgcgctcccgataccgagatcctgcagcgcgatgtcggccagaccctggccgccacgtac 
Ggcccggacctgttcgacatcgtcgcgctgcgccgcatgggctcggccaccgacagcacg 
gccccgccgggccagacgcgccgggtggtctattacgatgtggtgctgggcctgaagaag 
gacctcaccctgggcgcctgggaccagcccggcgccgccgcgctggtcagcctgctgggc 
gccgggccgcgcagcatctcgggggtgaaatccagcggcaatgccgccggcgaccagatc 
gtcgcccacgccagcgccatctaccagcgcgacgcagagcaatgggtgcacgtcgccccg 
gccagcttcacggccaccgaagcgccctcgctggacaccggcgcgccgccgccggtgacg 
cgccagctgctccagacgctggagcagatcacgcgttccgtgccctacagcgcctccagc 
accgcccagcacgtggtgcaacaggagctggagcgctcggtggcgcgcatcaatggccgg 
cttgcccgcctgcaaaagggctacccgctggcgaccggccccgacaagggcgagtacctg 
gcgttcggccaggcgctggccgcgatcgggcgcaacgagcaggtgcgcgtcattcccctc 
attaccggcggcagcgcggacaacatggccatgctgcgcagcggcgcggcggtggccgcc 
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ctgtcgcaggccgacatcgcgcaactggcctacgagggcaaggggccgttcgaaagccag 
ggaccgttctccgggttgcgcgcgctgggcagcctgtatccggagctggtgcacatcgtg 
gtgcgccagggcgatggcatcgccacggtgggcgcgctgcgcggcaagaagattgccctg 
ggcccgtcgggctcggcggtacgcaccacgctggagaccgtgctggcagcccatgggctg 
cagccggggcgcgactatgcagtcatcgacacgccggccgccgcggccctgccgcagctg 
agcgaaggacgggtcgacgcggtggcgcaggtcatcggtacgccggccgcgcccttgcgc 
gcggcgctgacccaggcgcgcctggcgctgctgccgctggaccgggctgcgatcgacaag 
ctggtgcaggccgatccgaccctgatggcgctggacatcccggccaacacctaccccagc 
caggccgcggccatccccacggtgggcatggcggcgctgctggtcaccacggccgatctg 
acgcgcgacgaggccgcgcatatggtggacgtggtataccgggccgggcaggacctgctg 
gccgccgggtccgcgcagggcgcgcaggtatccgcggccaacgccgggcgcggattgagc 
attcccctgcacgacggcgccgtggaagccttcgagaaactgggcgcgccgcccctgccc 
gagggcaggtag 

SEQ ID NO:60 polypeptide sequence of Orf30 

VTMFIRWLILSACLbliAACSRAPDTEILQRDVGQTLAATYGPDLFDIVALRRMGSATDST 
APPGQTRRVVYYDVVLGLKKDLTIiGAWDQPGAAALVSIiIjGAGPRS^ 

VAHASAIYQRDAEQWVHVAPASFTATEAPSLDTGAPPPVTRQIiLQTLEQITRSVPYSASS 

TAQHWQQELERSVARINGRIiARIJQKGYPIiATGPDKGEyLAFGQALAAIGR^^ 

ITGGSADNMAMLRSGAAVAALSQADIAQIiAYEGKGPFESQGPFSGLRALGSLYPELVHIV 

VRQGDGIATVGALRGKKIALGPSGSAWTTLETVIiAAHGLQPGRDYAVIDTPAAAALPQIj 

SEGRVDAVAQVIGTPAAPIjRAAIjTQARIaALLPLDRAAIDKLVQADPTIjMAIjDIPAOT 

QAAAIPTVGMAALLVTTADLTRDEAAHMVDVVYRAGQDLIiAAGSAQGAQVSAANAGRGIiS 

I PIiHDGAVEAFEKIiGAPPLPEGR 

SEQ ID NO:61 polynucleotide sequence of Orf31 

At gat ccgt at gc c t ggt 1 1 c cga 1 1 c t ccgt t ccgc cgcgccgccggc t ggccgt cgcg 
Gcgctgtgcgcggcgctgggcggctgtgcggtcgggcccgactaccagcgacccgccatc 
Gacgtgggggccgcctacaaggaggccgccgcgccgcagcccggctggacgcccgcgcag 
cccagcgacgagagcgcgcgcgggcaatggtggcaggtgtatggcgacccggtgctcgac 
ggcctggtgcagcaattgaaccagggcaactactccgtggcgcaggccgaggccaattat 
cgccaggcccaggcgctggtgcgcaatgcgcgcgccggcttcttccccaccataggcgcg 
ggcgccgacgtgacgcggtccggctcgggcggcggcagcggcgccggctcgaacggcagc 
tcggtcggcaaccagtactcgctcagtgggtcggtcagctgggaagtcgatgtgtggggc 
cgggtgcgccgcgaagtcgagtccagccgcgccgaggcgcaggccagcgcggcggacctg 
gccgtcacccgcctgagcgcgcaggccgccctggtgcagaactacctgcaattgcgcgtg 
ctcgacgagcagaaacgcctgctcgacgccacggtgctggcctacgagcgctcgctgcgc 
ctgacgcagaaccgctacgaagccggcgtggtgggcaagtccgacgtggcggtggcgcgc 
acccagctggagaacacgcgggcccagtccatcgacctggactggcagcgcggccagttc 
gagcacgccatcgcggtgctgatggggcaggcgccttcgcgcttcgccctgccggcgcag 
ccgttcgcgcagcaactgccggacatcccggcgggcctgccctcgcaactgctggagcgc 
cggcccgacgtggcggccgccgagcggcgcgcggccgccgccaatgcgcagatcggcgtg 
gcgcaggcggcctggttcccggacctgaccttgtcggccagcggcggttttcgcagcggc 
cagttcgccgagtggctgaccgcgccggcgcgcttctggaccctcggcccggcgctggcc 
at gacgc t gt t cgacggcggcgcgcgt t cggcgcgcgt cgagcaggcccgcgccgcc t at 
gacgcgcaggcggccgcctaccgccagagcgtgctgacggcgctgcgcgaggtggaggat 
tacctggtgcagctgcgcgtgatggagcacgagcagcaggtgcagcgcaatgcgctcgag 
tccgcgcgcgaatcgctgcgcctggcgcgcaaccagtacgagcaggggctgatcgactac 
etgacjcg tggcggtgctggaaaccaccgcgctgaacaccgagcgcaacgccatcagcctg 
ctgggcagccggctcaacgccagcgtgcagctgatcgcggcgctgggcggcgggtggcag 
ggct tgccggccgaggcggcggccagcgcggcggccgagccgt 7ccgcgccctag 

SEQ ID NO:62 polypeptide sequence of Ori31 

MiRMPGFRFSVPPRRRIiAVAALCAALGGCAVGPDYQRPAIDVGAAYKEAAAPQPGWTPAQ 
PSDESARGQWWQVYGDPVLDGIiVQQLNQGNYSVAQAEANYRQAQALVRNARAGFPPTIG^ 
GADVTRSGSGGGSGAGSNGSSVGNQYSXiSGSVSWEVDVWGRVRREVESSRAEAQASAADIi 
AVTRLSAQAAIiVQNYLQLRVIjDEQKRIiLDATVIiAYERSIiRIiTQlTO 

TQLENTRAQSIDIiDWQRGQFEHAIAVIjMGQAPSRFAIiPAQPFAQQIiPDIPAGIfPSQIiIiER 
RPDVAAABRRAAAANAQIGVAQAAHFPDIiTLSASGGFRSGQFAEWLTAPARFWTLGPAIiA 
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MTLFDGGARSARVEQARAAYDAQAAAYRQSVLTAIjREVEDYLVQIjRVME^ 
SARESIiRIiARNQYEQGLIDYLS VAVLETTAliNTERNAI SIiXiGSRXjNASVQLI AALGGGWQ 
GLPAEAAASAAAEPSAP 

SEQ n> NO:63 polynucleotide sequence of Orf32 

Atgacgcatcccgtcccgacgacctttgcacgtaccgccggcgcgctgcttgccgcgctg 
Gcgctggccggctgcgccgtggggccgcagtaccaggcgcccacgcccgcgccggtgaag 
Ctggccagccccgaacaggcgctgttctcggccgaccggttgcaacgcgaatggtggcgc 
cagttgcaggatgcccggctggacgcgttgatcggcctggcgctggcgcgcaacctcgat 
atcggcctggcgctggcgcgcaacctcgatatccgccaggcgcaggcgcgcctgcgcgaa 
gcgcgcgccgcgctcgacgaaaaggaactggaccgctggccgaccgtgaccgcggccggc 
ggctacacgcgcagcctgtcgcagatcaaccccggccccgaccagcgcaacctcgcgcaa 
agctaccgcgcgggcttcgacgcgacctgggaaatcgatttgttcggccgcctgcagcga 
cgggccgaggccgcggccgcgcgcgaccaggccgccgccgccgacctggcccagacgcgc 
ctggtggtggtggccgagctggcacgcaactatttcgagatgcgcggcgccgagcaacgg 
ctggccgtggcgcgcgccaacctcgccacccagcaggagacgctgcgcgtcaccgcggcg 
ctggtggaaaccggccgcggctatgccggcgacctggccagcgcacgggccgagctggcc 
ggcacgcgggcgctgctcgcgccgctggagacgcaacggcgcctggcccagtaccacatc 
gccgtcctggcggccatgcggccggccgagctgggcgagctgcggcaggagcagccgctg 
gcgccgctggccgcgcaattgcccatcggcgacgtggccatgctgctgcaacgccgcccc 
gacgtgcgcgccgccgagcgcctgctggccgccaccaacgccgacgtcggcgccatcacc 
gccgaactgtatccgcgcatcgacctgggcgggttcctcggtttcattgccttgcgcggc 
ggcgacctgggccaggccagcagcaaggccttcgcgctggcgccgacgatcagctggccg 
gcgttgcacctgggcagcgtccaggcgcagctgcgcgcgggccaggcccggcacgacgcg 
gcgcgggcgcgctacgaacaggtggcgctgcaggccatcgaggaagtggaaggcgcgttg 
acgcgctatggacagaaccagcagcggctgcgcgacctgcttgacagcgccacgcagagc 
cagcgcgccgccgacctggcgcaaacgcgctatcgtgaaggggccgcgccgtatttgacg 
gtgctggacgcgcagcgtactcttttgcgcgcacaggatgccgtggcgcaatccgagtcg 
gagtcctataccagcctggtcgcgctctacaaggccctgggcggaggctggaataccgac 
gccgccgcgcccgcccgttccgcccgcaccgccgccctgccggccagcccctga 

SEQ ID NO:64 polypeptide sequence of Orf32 

lOTHPVPTTFARTAGALIiAALAIiAGCAVGPQYQAPTPAPVKIiASPEQ^ 
QLQDARIiDAIjIGIJUjARNQ^IGIjAIjAiySJI^ 

GYTRSLSQINPGPIXilWIaAQSYRAGPDATWEIDLFGRIjQRRAEAAAT^RDQi^^ 

LVVVAELARNYPEMRGAEQRIiAVARANIiATQQETLRWAALVETGR 

GTIUVLLAPLETQRRIjAQYHIAVLAAl^PAELGEIJRQEQPLAPI^ 

DVRAAERLLAATNADVGAITAELYPRIDLGGFLGPIALRGGDLGQASSKAFAI^ 

ALHLGSVQAQLRAGQARHDAARARYEQVAIiQAIEEVEGALTRYGQNQQRLRDLLDSATQS 

QRAADIiAQTRYREGAAPYLTVLDAQRTLLRAQDAVAQSESESYTSLVAIiYKALGGGWl^ 

AAAPARSARTAALPAS P 

SEQ ID NO:65 polynucleotide sequence of Orf33 

Atgaaacctgtcgtcatgagaaccttgttgtcccttgccgtggccacggccctggccggc 
Tgctcgctggcgcccacctacgagcgcccgcaggcgccggtcgacgcggcctatccgtcc 
Ggcccggcctacggcgcgccgggccaggccgccgcgggcgcgccggccgccgccgacgtg 
ggctggcgcgacttcttcggcgacccgctgctgcaggagctgctggcgctgtcgctggcc 
aacaaccgcgacctgcgggtcgccgcgctcaacgtggaggcggcgcgcctcaacccgagc 
ggacaggccggcatcagccgcagctaccaggtcggtgccagcctgtcgacctgggagctg 
gacctgttcgggcgcatccgcagcctcagcgaacaggcgctgcagctctatctggcccag 
gacgaaacgcgcctggccacccagctgacgctggtggccgagaccgccaacgcctacccg 
accctgcgcgccgaccaggaactgctggcgctgacgcgccagacgctggcggcccagcag 
gagtcgtacaagctgacccgccagagctacgacctgggcgtggcgaccgagctggacctg 
agccaggccgagatttcgctgcgcaccgccgagcgcaatctgtcgcagtacacgcgcatg 
gcggcgcaggaccgcaacgcgctggtgctgctggtgggccagccgctgccfggccggcatc 
ggcgcgcagctggaccaggccgtggcgctgcccgacggcgtggtcctggccgacctgccg 
gcgggcctgccgtcggatctgctcgcgcgccggccggatatccgcgcggcggagcaccag 
ctgcaagccgccaacgccagcatcggcgcggcgcgcgcggcgttcttcccgcgcatcagc 
ctgaccggctcggccggcacggccagcgccagcctgggcggcctgttcgatgccgggtcg 
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99ggcctggagtttcgcgccgcagatcagcgtgccgatcttcgcgggcggggcgctgcgc 
gccagcctggacctggccaagatccagaaggacatcggcatcgcgcgctacgagcaggcc 
atccagagcgggttccgcgaggtctccgacgcgctggccggccgcggcacattgcaggag 
cagatccggtcgcaggaactgctggtgcaggccaaccagcgcgcctacgacctgtcgcag 
cagcgttaccagcagggcatcgacaactatctcagcgtgctggattcgcagcgttcgctg 
tatacggcgcagcagacgctggtcgagacgcggctggcgcgcctgtccaacctgatccag 

ctctacaaggcgctgggcggcggctggtccgagcgcacggtggcggcggcgcaggccggc 

tga 

SEQ ID NO:66 polypeptide sequence of Orf33 

MKPVVMRTIjIiSIiAVATAIiAGCSIiAPTYERPQAE>VDAAYPSG 

GWRDFFCaDPLLQEIjIiALSIiAimRDIjRVAAIiNVEAAM^ SRSYQVGASLSTWEb 

DLFGRIRSLSEQAIjQIiYIiAQDETRliATQLTLVAETANAyPTLRADQEIjI^ 

ESYKZiTRQSYDLGVATEIiDIjSQAElSriRTABRNnCjSQYTRMAAQD 

GAQUDQAVALPDGWLADLPAGLPSDLLARRPDlRAAEHQriQAANAS IGAARAAFFPRI S 
LTGSAGTASASIiGGLFDAGSGAWSPAPQISVPIFAGGALRASIiDIiAKIQKDIGIARYEQA 
IQSGFREVSDALAGRGTLQEQIRSQELLVQANQRAYDLSQQRYQQGIDNYLSVIiDSQRSli 
YTAQQTLVETRLARIjSNIjIQIiYKAEjGGGWSERTVAAAQAG 

SEQ ID NO:67 polynucleoticle sequence of 0]rf34 

Atgaaacagcataaggtcggcaggcactgggcaggatgggcgatggcgctggcgtgcctg 
Ggcgcggccgcgccgctggcggcgcagccggcggcaccagctggggccgcgcaggcgcgc 
Gaactgctgctggaggtcaagggccagcagccgttgcgcctggacgccgcgccatcgcgc 
gtggcgatcgccgatccgcaggtcgccgacgtcaaggtgctggcgcccggcgtgggccgc 
ccgggcgaggtgctgctgatcggccggcaggccggcaccaccgagctgcgggtctggagc 
cgcggctcgcgcgacccgcaggtctggaccgtgcgcgtgctgccgcaagtgcaggccgcg 
ctggcgcggcgcggcgtcggcggcggcgcgcaggtcgacatggctggcgacagcggcgtg 
gtcaccggcatggcgccctcggccgaggcgcatcgcggcgcggccgaggctgccgcggcc 
gccgcgggcggcaacgacaaggtggtcgacatgtcgcagatcaacaccagcggcgtggtg 
caggtggaagtgaaagtggtcgagctggcgcgctcggtcatgaaggatgtcgggatcaat 
ttcagggccgacagcggcccgtggtcgggcggcgtgtcgctgctgccggacctggccagc 
ggcggcatgttcggcatgctgtcctataccagccgcgatttcagcgcgtcgctggcgctg 
ctgcaaaacaacggcatggcgcgcgtcctggccgagccgacgctgctggccatgtcgggc 
Gagagcgccagcttcctggccggcggcgagattccgattccggtatcggccggcctgggt 
acgacctcggtgcagttcaagcccttcggcatcggcctgacggtcacgcccacggtcatc 
tcgcgcgagcgcatcgcgctgaaggtggcgcccgaagccagcgagctggactacgccaac 
ggcatttccagcatcgacagcaacaatcgcatcacggtgatcccggcgttgcgaacccgc 
aaggccgacaccatggtggagctgggcgatggcgagacattcgtcatcagcggcctggtt 
tcgcgccagaccaaggccagcgtcaacaaggtgccgctgttgggcgacctgcccatcatc 
ggggcgttcttccgcaacgtgcagtattcccaggaggatcgcgaattggtgatcgtggtc 
acgccgcgcctggttcgccccatcgcgcgcggtgtcacgctgcccttgccgggcgcgcgc 
caggaggt cagcgacgc t ggc 1 1 caacgcc tggggct at tacctgctgggt ccgatgagc 
ggccagcagatgccgggc tttt cacagtga 

SEQ ID NO:68 polypeptide sequence of Orf34 

MKQHKVGRHWAGWAMALACXGAAAPIiAAQPAAPAGAAQAREI,^ 

vaiadpqvadvkviiapgvgrpgevliiigrqagttelrwsrgsrdpqvwtvrviipqv^ 
larrgvgggaqvdmagdsgvvtgmapsaeahrgaaeaaaaaaggitokv^ 

QVEVKVVEIJARSVMKDVGINFRADSGPWSGGVSLLPDLASGG^^?GMIlS^ 
LQNNGMI^VIjAEPTLIiAMSGQSASFIAGGEIPIPVSAGLGTTSVQFKPFGIGLTVT^ 
SRERI ALKVAPEASELDYANGI S S IDSNNRITVI PALRTRKADTMVELGDGETFVI SGLV 
SRQTKASVNKVPIjIjGDIjPIIGAFFRNVQYSQEDRELVIVVTPRIiVRPIARGVTLPIjPGAR 
QEVSDAGFNAWGYYLIjGPMSGQQMPGFSQ 

SEQ ID NO:69 polynucleotide sequence of Orf35 

Atgaagcgacttctctgtctgtccctgctgtccgtattgctggcggcgtgcacgacccca 
Tcgcagattccgcccgagacggcgcccggcggcgtgccgccggcggccgaaggtccgctg 
Gtcgtgccgccgctgtcggcgctgtccgacaccccgccgcgcgcgctggccgggcgctac 
cagcgcgttgcctggaccgagctgcccaactgggagagcgacgacctgtcgcgctggtgg 
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ccgctgttcctgcgcaattgcaaaggcctgatgcggccgaccagcggtaacctggcggcg 
ccggcacgcgccacgccgcgcgcctggcagcccgtgtgcgcggcggcggtcgacccgtcc 
aaggcgccggccgccggcgacagcgcggcggtgcggcgcttcctgcagacctggctgcag 
ccctggcgcatcgccggcgccgacggccgtcccgccaccaataccgtcaccggctactac 
gagccgctggtgcgcggctcgcgccgccagggcggccgctaccagtggccgctgtatgcc 
gtgccggccgacctgctcgtcgtcgacctgggctcggtctatcccgacctgaccggcaag 
cgcgtgcgcggccggctcgacggccgccgggtcgtgccctacgacacgcgcgccgcgatc 
gaggcgggcgaccgcaagccgccggccatcgtctgggtggacgatccggtcgacaatttc 
ttcctgcaggtccaggggtcgggccgggtgcagctgaccgatggccccgaccgcggcacc 
acgatccgcgtcgcgtacgccgaccataacggccagccctatgcctccatcggccgctgg 
ctcatcgacaagggcgagctgcgcgccgaccaggcatcgatgcagaacatccgtgcctgg 
gcccaacgcaat ccct cgcgcgtgcaggaaatgct caacgccaacccggcggtggt ct t c 
1 1 ccgcgaagaggcggt ggt cgat ccggagcaagggcccaagggggc ct at ggc at c ccg 
ttggcgccgcagcgctcgatcgcggtcgacgccggtttcgtgccgctgggcacgccggtc 
tacctgtcgaccacgctgccggcctccgaccggcccctgcagcgcaccgtgttcgcgcag 
gacaccggcacggccattcgcggcgcggcgcgcgccgacttctattggggctacggcgag 
gaagccggccagcaggccgggcgcatgaagcagcgcggccagatgtggctgctgtggccc 
aagcaggccggggagccgtcggcgcgatga 

SEQ ID NO:70 polypeptide sequence of OrfSS 

MKRLIjCIjSIjLSVI.riAACrTPSQlPPETAPGGVPPi^GPLVVPPLSALSOT 

QRVAWTELPNWESDDLSRWWPLFLI^CKGLMRPTSG^n^AAPARATPRAWQPVC^^ 

KAPAAGDSAAVRRFLQTWLQPWRIAGADGRPATOTVTGYYEPLWGSRRQGGRYQWP^^ 

VPADLLVVDLGSVYPDLTGKRVRGRLDGRRVVPYrrnUUVIEAGDRKPPAIV^^ 

FLQVQGSGRVQIiTDGPDRGTTIRVAYADHNGQPYASIGRWIilDKGELRADQASMQNIRAW 

AQRNPSRVQEMLNANPAWFFREEAWDPEQGPKGAYGI PliAPQRS I AVDAGFVPLGTPV 

YLSTTLPASDRPLQRTVFAQDTGTAIRGAARADFYWGYGEEAGQQAGRMKQRGQMWLLWP 

KQAGEPSAR 

SEQ ID NO:71 polynucleotide sequence of Orf36 

Atgttcaactgtcggcgattcctgcaaatcggcacgctgtcggccctgctggccggctgt 
Gccacctccagccaaacaccccaagcccagcatcttcccgcgcaggccgccacaggccag 
Gccgaccgcgtccgcatcggcccggacaaacccgtatcgagcgacgaaggccccgccacg 
ctgacgccgaccggcgaactgcggcccgacgtccgcgccttcgccgaacagctggcggcg 
cagcgcgagctgcccctgccgcaagtgctggccagcctggaaagcacgcgctacaacgcg 
accgtcgcccgcctcatcgccccgtccggcgcgtcgggcaagaaaatctggcgcagctgg 
ctgacctatcgcgggcgtttcgtcgaacccaagcgcatcgcctggggcgtggaattctgg 
aacgccaaccaggacctgctcaaccgcgccgcccagcgctacggcgtgccggcctcgatc 
atcgcctccatcatcggcgtggaaaccctgtatggccgcaacgtgggcaacttccgcgtg 
gtcgacgccctggcgacgctggcattcgactacctcgatcccgccaagcccgagcgcgcc 
gacatgttccgcggccagctcggcgacttcatcaccctggcgctgcaggacaagctggac 
cccgagacgcgcggctcgtacgccggcgccatcggcatgccgcaattcatgcccggcagc 
atcatgcgctatgcggtcgatggcgatgacgacggccacatcgacctgaccaacagcgtc 
gcggacgcggtcatgtcggtgggcaacttcctggtcgaacatggctggcagcgcggcctg 
ccggtgttcgcgccggtcgcgctgccggccgatccggcgccgctggtggccggcggcctt 
acgccgacgctggactggaacggcctgcaggccgccggcgcgcgcccggcggcgggcgcc 
ggacgcggcgcctggcaggagcaccccatgggcatcgtggacctggtcgaggaagcgcgc 
ggcaccgtgcaataccgtaccgccacgcccaatttctttgccctgacgcaatacaaccgc 
agct actt ct atgccacggcggtggccgacctggcggccgaactgcaggcccgcacgggc 
tattga 

SEQ ID NO:72 polypeptide sequence of Orf36 

MPNCRRFLQIGTLSALIiAGCATSSQTPQAQHLPAQAATGQADRVRIGPDKPVSSDEGPAT 
LTPTGELRPDVRAFAEQLAAQRELPLPQYIiASLESTRYNATVARLIAPSGASGKKIWRSW 
LTYRGRFVE PKRI AWGVEFWNANQDLLNRAAQRYGVPAS I IAS 1 1 GVETLYGRNVGNFRV 
VDAIiATIiAFDYIiDPAKPERADMPRGQIiGDFITnALQDKIjDPETRGSYAGAIGM 
IMRYAVDGDDDGHlDLTNSVADAVMSVGNFLVEHGWQRGIiPVFAPVALPADPAPLVAGGIi 
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TPTLDWNGIiQAAGARPAAGAGRGAWQEHPMGIVDLVEEARGTVQYRTATPNPFALTQY^ 
SYPYATAVADLAAELQARTGY 



SEQ ID NO:73 polynucleotide sequence of Orfi? 

Atgaaccatagactcatacgttgcctgagcatcgcgctgctggccctgctgtcgggctgc 
Agcattctctccgggtcgggcccgacgcgatcggccatcatggacggcgggtcgaccgac 
Gcgaccggcgccaagctcggctcctacgacctggtggacctgcgcgccgacaccattgcg 
ccctatgtgctggtcaaggcggtgtccaaggatggcgccacctcggacggctacgtgggc 
aatatgcgcgtgatgccgggcgatgtgctgcgcatcctggtagccgacagcatggagacc 
ggactgttcgcgccgctggccgccggcggcacggtgttcgaagccgtgcgggtcgcggcc 
gacggcagcatctcgctgccctatgcgggccgcctgaaagtgcagggcaagtcgctggcg 
cagatcgagcagctcgtcaagggcagcctgcgcaataccgcggcggtgcagccgcaggcc 
citggtggatctggccgacgaccgctccaattcggtgctggtggccggggcggtgccgcgc 
ccgggacgcttcggcggcaacaagggcccgctgacggcgctggatgcgatcacgcaggcg 
ggcggctcgaccctgccggcttaccaggccgacgtagtgatccggactggcagcaaggtg 
cagcgcattccttaccagcaattgctcaacggccgcaacgtggcggtggagccgcgctcc 
gaactggtggtcgaaccgaacctgaagcgtttcgtggcgatgggggcccttaccaagccg 
ggcctgcacgaactgccgtcgaaccagaccaatctgctcgacgccctgggcgtggccgga 
ggcctgaacgaccgcgcggccgacgccaccggggtattcgtttttcgcctggacggccgc 
aacgccgatggccgcccgcggcccacggtgttcaggctgaatatgcgcaatccggagtcc 
atgttcctggccaagcaattcgagctgctgccggaggacgfcggtgtatgtcagtaatgcg 
cccatgtacgaatgggaaaagatcattacgcctatcgtgcaggtcctgatcgtgggccaa 
cgcgtgggtacttactaa 

SEQ ID NO:74 polypeptide sequence of Orf37 

MljmRLIRCLSIALLALXiSGCSIIiSGSGPTRSAIMDGGSTO 

pyVIiVKAVSKDGATSDGYVGNMRVMPGDVLRILVADSMETGLFAPLAAG^ 

DGSISLPYAGRLKVQGKSLAQIEQLVKGSIiRNTAAVQPQAMVDIiADDRSNSVLVAGAVPR 

PGRFGGNKGPLTALDAITQAGGSTLPAYQADWIRTGSKVQRIPYQQIiIiNGRNVAVEPRS 

ELVVEPNIiKRFVAMGALTKPGr>HEnPSNQTNLtJDAr.GVAGGIi^ 

NADGRPRPTWRLimRNPESMPIiAKQFELIjPEDVVYVSNAPMYEWEKIITPIVQV^ 

RVGTY 

SEQ ID NO:75 polynucleotide sequence of OrG8 

Atgcaacgtctcatgcccatcctggtcggactgctcgtcgtcctggccgtcctgtcttca 
Tgcgtcttcgtggtccgcgagcgcgactacgccctggtgttctcgctgggcgaggtgcgc 
Caggtcatcagcgagcctggcctgtatttcaaggcgccgccgccgttccagaacgtcgtc 
acgctggacaagcgcatcctcaccatcgagtccagcgatgccgagcgcatccagacctcc 
gagaagaagaacctgctgatcgactcgtacgtcaagtggcgcatcgccgatccgcgcctg 
tactacgtgaccttcggcggcaacgagcgcgccgcccaggagcgtctgcaggcgcagatc 
cgcgacgcgctgaacgcggcggtcaacgtgcgcacggtcaaggacgtggtctcggccgag 
cgtgacaaggtcatggccgaaatcctcaccaacgtcgtcaagcgcgccgagccgctgggc 
gtgcaggtggtcgacgtgcgcctgcgccgcatcgagttcgcgcccgagatttccgagtcg 
gtctatcgccgcatggaagccgagcgcacccgcgtggccaacgagctgcgttcgatcggc 
gcggccgaaagcgagaagatccgcgccgaggccgaccgccagcgcgaggtcatcgtggcc 
caggcct a t gcgcgcgcccagggca t ca t gggcgagggcgacgcc caggc cggc age at c 
tacgcccaggccttcggccgcaataccgagttctacacctattacaagagcctggaagcc 
tatcgcgccgcgttcggcaaaaccggtgacgtattggtggtcgatccgacgtcggagttc 
ttccagttcttcaagaaccccggcaagggcgcggcgggcgccccggcaccggcgaattga 

SEQ-ID NO:76 polypeptide sequence of Orf38 

^XQRIl^<PILVGLLVVIJAVLSSCVFVVRBIa^YALV^ 

TLDKRILTIESSDAERIQTSEKKWIililDSYVKWRIADPRIiYYVTFGGNER 
RDALNAAVNVRTVKDWSAERDKVMAEILTNVVK^^ SES 
VYRRMEAERTRVANELRS IGAAESEKIRAEADRQREVI VAQAYARAQGIMGEGDAQAGS I 
YAQAFGRNTEPYTYYKSLEAYR7U\FGKTGDVIiVVDPTSBPFQPFKWPGKGAAGAPAPi^ 

SEQ1irNO:T7 polynucleotide sequence orOr£39 
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Ttgcccagggaggcaaccatgaaacccgtcatccagactttcctgcgcgccgccgccgtg 
Gccggcctggcgctgctggccggctgcgccggcgtcagcacgacgcagtccggcgcgatc 
Ggcgtggaccgcacccaatacatgtcgagcctggtgcccgagcaggcgctggtgcaggag 
gccgggcagcagtatgccgagatcgtccaggaggcccgcgccaaggggctgcttgaccgc 
gacccggcgcaattgtcgcgcgtgcgcgccatttcccagcgcctgatcgcgcagaccggg 
gtgtttcgcgccgacgcggccaactggccatgggaagtgcatgtgctgtcggtcgacgag 
gtcaacgcctggtgcatgcccggcggcaagattgccgtctacacgggcctgctcgcccat 
atcaagccgaccgacgacgaactggcggcggtgctgggccacgagatcgcgcatgcgttg 
cgcgagcacgcgcgcgagcgcgtctcgcagcagatggcgaccagcatcggcctgtcggtg 
ctgtccatggccaccggttcgcccggcgcgtccgacctgggcggcaagctgaccgaagtc 
atgttcaccttgcccaacagccgcacgcacgagaccgaggccgatcgcatgggcgtcgaa 
ctggccgcgcgcgccggtttcgatccgcgcgccgccgtcacgctgtggcagaaaatgggc 
gcggccgacggcaatgcgccgccggagttcctgtccacccacccgtcggccagtacccgc 
atcggcgaattgcagcaggccttgcagaaggtattgccgctgtacgagcaggcgcgcggc 
caggccgccaaatag 

SEQ ID NO:78 polypeptide sequence of Orf39 

IjPREATMKPVXQTFLRAAAVAGLALIAGCAGVSTTQSGAIGVDRTQYMS 

AGQQYAEIVQEARAKGLLDRDPAQLSRVRAISQRIiIAQTGVFRADTVANWPWEVHVIiSVDE 

VNAWCMPGGKTAVYTGLIjAHIKPTDDEIiAAVLGHEIAHALRE^^ 

LSMATGSPGASDLGGKLTKVMFTIiPNSRTHETEADRMGVELAARAGFDPRAA 

AADGNAPPEFLSTHPSASTRIGELQQAIiQKVLPLYEQARGQAAK 

SEQ ID NO:79 polynucleotide sequence of Orf40 

Gtgactcaccgtcccgctgcactctcgaagcccgcctcccgccgcggggtggccctgcgc 
Gcggcgatcgcgctgtcaaccattctgatcgtggccggctgcggctcgtcaagcaccaaa 
Tacgacaagaccgcgggctggagcgccgaacagttgtacgccgacgccaagcaggaagtc 
gcggcgggcaactggaccgatgcccgggagcgcctgaccgccatcgaaagccgctacccg 
ttcggcacgtacgcccagcaggccctgatcgaactggcttacgtcaactggaaagacggc 
gagaacgaacaggcgctggccgccatcgaccgcttccagcagctctatcccaaccacccg 
ggcacggactacgtgctgtacctgaaggggctggtcaacttcacgccggccagcgccttc 
atgagcaacctgaccggccaggaccccgccgagcgcgatcccaagggcctgcgcgcgtcc 
tacgatgcgttcaacgaactggtccagcgcttccccaacagcaagtacacgcccgatgcg 
cagaagcgcatgacctggctggtcaacgccatcgccatgaacgaagtccacgtggcgcgc 
tactactacgagcggggcgcctacgtggcggccgccaaccgggcgcagaccgtgatcacc 
gatttcgagggggcccccgcctcggaagaagcgctctatatcatggtcgagtcgtatgac 
aagctgggaatgaccgaactgaagggcgacgccgaacgcgtgctcgaccagaactatccc 
aacagcaaattcaagacgcaaggcctgtcggccgacaagagctggtggaacccgttctcg 
tggcgctga 

SEQ ID NO:80 polypeptide sequence of Orf40 

VTHRPAALSKPASRRGVAIiRAAIAIiSTILIVAGCTSSSTKYDKTAGWSAEQLYADAKQEV 
AAGNWTDARBRLTAlESRYPFGTYAQQALIEIiAYVNWKDGENEQAL^ 
GTDYVIiYLKGLVNFTPASAFMSNLTGQDPAERDPKGLRASYDAPNELVQRFPNSKYTPDA 
QKRMTWIjVNAIAMNEVHVARYYYERG^ 

KLGiyrrELKGDAERVLDQNYPNSKFKTQGLSADKSWWNPPSWR 

SEQ ID NO:81 polynucleotide sequence of Orf41 

Ttgcccccacaggttgaccttgccatgacgaagcactctgccgctcgaatcgccaccatc 
Gccgccgcaggcgtcctgctggccggctgcgcagcgcccaagaaccccgatccgcgcgat 
Ccctgggaaggcttcaaccggggcgtctacaagttcaacgacacggtcgaccgcgcgctg 
ttcaagccggtggcccaggcctataccttcgtcaccccgcagccggtgcgcagctgcgtg 
cacaatatgttcagcaacgtgggcgacctgtggtcggccaccaacagcttcctgcaaggc 
cgcgggcacgatttcgtcaacacgatcggccgcttcctgttcaataccaccatggggatc 
ggcggctgcttcgacgtcgcgtcgaccaccggggcgcgcaagatccccaacgacttcggc 
gtgacgctgggcgtctggggcttcggccagggaccgtacctggtgctgccgatctggggc 
gccagcagcctgcgcgacggcgtcggcctgatcggcgactggaccggcaaccagggcgcg 
accatcggcgcgatcgacaacgtgccgctgcgcaactcgctgtggggcctggaggccgtc 
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gacctgcgcgccagcctgctcgataccaccgacaccgtggaccgcgtggcgctggatccc 
tacagcttcgtgcgcgacgcctacctgcagcgccgcgccgccatggtgcgcggcaccaag 
acgggcgacgacacgctgcccacctatgaagacgagggcgatgacgacgcggcccccgcc 
gcgccggccgcccagccggccgcccagccgcagtaa 

SEQ ID NO:82 polypeptide sequence of Orf41 

LPPQVDLAIOTKHSATUIIATIAAAGVIjIjAGCAAPKNPDPRDPWEGFNRGV^ 

FKPVAQAYTFVTPQPVRSCraimFSWGDIiWSATNSFIiQGRGHDFV^ 

GGCFDVASTTGARKIPITOFGVTIiGVWGFGQGPYLVLPIWGASSLRDGVGLIGDWTGN^^ 

TIGAIDim>LRNSLWGLEAVDIjRASLIiDTTDTVDRVALDPYSFV^ 

TGDDTLPTYEDEGDDDAAPAAPAAQPAAQPQ 

SEQ ID NO:83 polynucleotide sequence of Orf42 

Atggcaacaaagtgcctgctccaggggagttttccggatgccagcccgataatgccggca 
Atgcgtagtggcgccgcatgggtgctggaagggaggtttatgcggtttggatggggattg 
Ccggcgctggccgtcgtgcttgcgctggccggatgcgtgaatcgcgagccagaggagcgc 
gcggccttcatcgcgtatctggaacaagtggccgcgccgcaggcgggcgtcgtggccgcg 
ccgcccgacccgcccacgcgcaaggccctgggcgactacgaggcgcagtacgagccgatg 
gaagcggcgcacgccgccgtgcgcgaagcgttggcggcgcagcaggcggcgctgcaggcg 
ctgcggctgcattcggtcgacgagatcgtcgcacgccaggacggctgggacaggctggcc 
gagcgcctggcggccgcgcgcaccgggctcgaacaggcgcgcgccgccgccgacgccgcg 
cgcgccgggatggagcagcctcccgacctgcgcaacgcctacgcgcgcgcctatgaacac 
agcgtcacggcgccggcacaggccttggcgcggatatccggcctgctcgaacccgccgtg 
gaggatgcgcggcgcgtggccgggttcgttgcgcgccatcgcgatcaggtcgataccgat 
ggtccgctgacccaggtgcgcgatccctcggtgcgcagcgagctcaatgtactgctgcag 
gcgctcaatggccgctccgaccaggtttcgcaggcgcaggccttgctcaatggcctggcg 
ggaccggctcgccaggcgccctga 

SEQ ID NO:84 polypeptide sequence of Orf42 

MATKCLIiQGSFPDAS PIMPAMRSGAAWVLEGRFMRPGWGLPALAVVIiAI^^ 

AAFIAYLEQYAAPQAGWAAPPDPPTRKALGDYEAQYEPIffiAAHT^VREALAA 

LRIiHSVDEIVARQDGWDRIiAERIAAARTGLEQARAAADAARAGMEQPPDLRNAYARAYEH 

SVTAPAQAIUARISGLLEPAVBDARRVAGFVARHRDQVDTIXSPLTQVRDPSVRSEr^^ 

ALNGRStX^VSQAQAUJNGIiAGPARQAP 

SEQ ID NO:85 polynucleotide sequence of Orf43 

Gtgatgctgaagaccgtattgcgcctgccggtctgcgccgcgctgctggcgctggccgcg 
Ggctgcgcgatgattccgcccgaaccggtggtgatctgtccgctgaccgcgccgcctccg 
Tcgccgccgcaaccctcggcgcggcccaacggctcgatctaccagccttcggcctacggc 
aactatccgctgttcgaggaccgccggccgcgcaacgtgggcgacatcgtcaccatcgtg 
ctggaggaaaagaccaacgccgccaagggcgtggccaccaataccagccgcgacggctcg 
gccacgctgggcgtggcggccgcgccgcgcttcatggacggcatcatcaacgacaagctg 
gataccgatatctcgggcggcaataccgccaacggcaccggcaagagcagcgccaacaac 
accttcaccggcaccatcacgaccaccgtgatcggggtgctgcccaacggcaatctgcag 
atcgccggcgagaagcagatcgccatcaaccgcggcagcgagtacgtgcgcttctcgggc 
ertggtcgacccgcgatcgatcaccggcagcaatacggtgtcgtcgacccgggtggccgac 
gcgcgcatcgaataccgcagcaagggcgtcatggacgaagtccagaccatgggctggctg 
caacgctttttcctgatcgcttcgccgttctga 

SEQ ID NO:86 polypeptide sequence of Orf43 

VMLKTVLRIiPVCAALIJUJUVGCAMIPPEPWICPIiTAPPPSPPQPSARPNGS^ 
irx'FLFEDRRFKIWGDIVTriVLEEKTNAAKGVATNTSiaDGSATI^^ 
DTDISGCasrTANGTGKSSANNTPTGTITTTVIGVLPNGN^ 
VVDPRSITGSNTVSSTRVADARIEYRSKGVMDBVQTMGWLQRFFLIASPF 

SEQ ID NO:87 polynucleotide sequence of Orf44 

Atgaagtcgtccctgtatcgaatcgcagcgctcagcgccgctgccctgttgctggccggc 
Tgcgccaaccagcgcgctccgaaggagtcgggcttcctcggcgattactcgcagttgcgc 
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Gaggagcaggtgcccggcggcgcgcggctgatctaccgcgacgccgcgctcaagccgcgc 
cagtacaccgccatgtggctgtcgccggtcgagtactaccccagcccgcaaccgtcggcg 
caggtgtcgatggaaacgctgaccgaactgcagaactacctggaccagtcgctgcgccgc 
aagatcggccgcgagatccgcctggtcaacggccccggcccgggcgtggccaaggcgcgc 
atcgcgatcacagcggrtcggcagcgaaagcgaggcgctggcggcctaccagtacatcccc 
9tggcgctggccgtcaccggcgccagggccgtgctggaaggcggccggccgcagcaggcc 
accatcgcgatcgaaagcaaggtcaccgacagccagacgggccagctgctgtgggcgtcg 
gtgcgcgggggcaccggcgagcgcgtacgcgccatcgcccagggccaggcctcggtgccg 
gcctcggcgctcaagccgctgatcgacgaatggaccgataacgtcgcacgtgaaatacgc 
aactacgtgcgcagcaaataa 

SEQ ID NO:88 polypeptide sequence of Orf44 

MKSSLYRIAAIiSAAAIiLIiAGCANQRAPKESGFLGDYSQLREEQVPGGARDIYRDA?^ 
QYTAMmiiSPVEYYPSPQPSAQVSMETLTELQlSrmDQSIiRRKIGREIRLW 
lAITAVGSESEAIiAAYQYIPVAIAVTGARAVIiEGGRPQQATIAIESKVTDSQTGQIiLWAS 
VRGGTGERVRAIAQGQASVPASAIiKPIilDEWTDNVAREIRKnrVRSK 

SEQ ID NO:89 polynucleotide sequence of Orf45 

Gtgaaccaacgtggggcccttttacccgttaacacgtgtgactctctttgcaaaggaact 
Atcatgaagtcgcgcattgccaaaagcctaaccatagctgcgctggccgccacgctggca 
Gcctgcasrttccgtccctctcgacgacaaggcaggtcaagctggaggctccggccagggt 
tcggcctccggccagatcctggatcccttcaacccgcaaagcattctggcgcaacagcgc 
t cggtgt ac 1 1 1 gac 1 1 cgacagct at acggt gt cggaacag t at cgcggcc tggt cgaa 
acccacgcccgctacctggcttcgaacaaccagcagcgcatcaagatcgaaggcaatacc 
gacgaacgcggcggcgccgagtacaacctcgcactgggccaacgccgtgccgacgctgtc 
cgtcgcatgatgaccctgctgggtgtgtcggacaaccagatcgaaaccattagtttcggc 
aaggaaaagccgaaggcgacgggttcgagcgaggctgatttcgccgagaaccgccgcgcc 
gat at cgt 1 1 at c agcgc t aa 

SEQ ID NO:90 polypeptide sequence of Orf45 

WQRGALLPVOTCDSIiCKGTIMKSRIAKSLTIAAIiT^TLAACSSVPLDDKA^ 
SASGQILDPFNPQS I LAQQRSVYFDFDSYTVSEQYRGLVBTHARYIiASNNQQRIKI EGNT 
DERGGAEyl^^^AIlGQ^UlADAVRR^!MTXlIJGVSDNQIETI SPGKEKPKATGSSEADFAENRRA 
DIVYQR 

SEQ ID NO:91 polynucleotide sequence of Orf46 

Gtgtccatgatcgcacgtatttccctgcggcctctgaaggggctcgcggtggctgtcctg 
Gcagcctccgccctgaccgcctgctcgtccggcaaatggggattcccctacaaggccggc 
Gtccagcaaggcaactggatcaccaaagagcaggtcgccctgctgcagcaaggcatgtcg 
cgcgaacaggtgcgcttcgccctgggcagccccacgctgaccagcgtgctgcacgccgat 
cgctgggattacccctactacttcaagcccggctacggcaaggcgcaggaacgccagttc 
accgtgtggttcgagaacgaccacctggtacgctggagcggggatgaacagcccgacctc 
cagccgttccagatcgagaaagtgaacgccaaacaggaagaaaaagccgacgcccaggtg 
gatacggccgagaagcgccaggaaggcatcgacaaggctgaaaaagtccggccccatgtc 
gatgtcacgacgccggacaaccccaccctcgactacccgggcgagccgggccaaaccttc 
gaaccgctcaagtaa 

SEQ ID NO:92 polypeptide sequence of Orf46 

VSMIARISXjRPLKGLAVAVLAASALTACSSGKWGFPYKAGVQQGIWITKEQVALXjQ 
REQVRFAIiGSPTIiTSVLHADRWDYPYYPKPGYGKAQERQPTVWPEiroHLWWSGDEQPDL 
QPFQIEKVNAKQEBKADAQVDTAEKRQEGIDKAEK\mPHVDWTPDNPTLDYPGEPGQT^ 
EPLK 

SEQ ID NO:93 polynucleotide sequence of Orf47 

Atggcgacccatcctgtcgggccaacgttgctggcggcgctgacgctgcttgccgcctgc 
Agcggttccatggcgcaagagccgccctacaagagcacgatactgggcttgcaggcgacc 
Atcctggacctgaagggcttgccgtccgacaccgacggcggcatatcggacctgagcgcc 
caagtgggtgcgctggccgcgcgccatgaaggcgtgtcggtacggcagggcaaggatgcc 
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gtcaccatcgccatgatgggcgacgtactcttcgatttcgacaaggccgacatactcgcc 
gcggccgaacccactctgcgggacatcgcggagctgatcaaatcccccgccaccggcatc 
gtcgccattgaaggtcacacggactccaagggctcggattcctataacaagggcctgtca 
ttgcgacgggcccaggccgttgcgcagtggctgggcgctcacggggtggatgcagcgaaa 
ctgtcggtcaggggcctgggggctgccaggcccgtacagcccaaccagctagctgtgaag 
attcaatag 

SEQ ID NO:94 polypeptide sequence of Orf47 

MATHPVGPTLLAALTLLAACSGSMAQEPPYKSTIIjGLQATIIiDLKGLPSDTDGGISD 
QVGAIiAARHEGVSVRQGKDAWIAMMGDVLFDPDKADILAAAEPTLRDIAELIKSPATC 
VAIEGHTDSKGSDSYNKGLSLI«?AQAVAQWI,GAHGVDAAKIjSVRGrjGAARPVQPN^ 
IQ 

SEQ ED NO:95 polynucleotide sequence of Orf48 

Atgaactatatgcattccccctctgtagttgccgggcgcgcccgccgcctgctggcggta 
Gcggcggttgccggctcggtggccgttctggccggctgcgccaatcccagcgcatcgagt 
Ggggtgtacaccftacggccaggcgcagcgcgagcagatcgtgcgcaccggcacggtcacc 
ggcgtgcgtccgattaccatccagaacgacaagtccagcggcgtcggcttggtggccggt 
ggcgcgctgggcggggtagcgggcaatgccgtcggcggcggcaccggccgcaccatcgcc 
acggtgggcggcgtcatcctcggcgcgctggcgggcaacgccatcgagaaccgcgcgggc 
aagtcctccggctacgaaatcacggtgcgcctggacaacggcgaaacccgggtcgtggcg 
caggaagccgacgtgcccatcagcgtgggccagcgcgtgcaggtcatcagcggcgcgggc 
ccgacGcgcgtgacaccgtattga 

SEQ ID NO:96 polypeptide sequence of Orf48 

lyamiHSPSWAGRARRLIjAVAAVAGSVAVIiAGCANPSAS 

GVRPITIQjmKSSGVGLVAGGALGGVAGNAVGGGTGRTIATVGGVIIjGAIiAGI^ 
KSSGYEITVRIiDNGETRWAQETUJVPI SVGQRVQVISGAGPTRVTPY 

SEQ ID NO:97 polynucleotide sequence of Orf49 

Ttggcgttgatcagcaaaaaggagcgcatcttgaaaaccctgctacccgtattggcgctt 
Gccgccctgctgtcggcctgcaacgcgaacgccccctcggatacgcccgagggcgcgccg 
Ccgcccgatacgcatacctcgcgcaattcgctggactggcaaggcacgtaccagggcgtg 
ctgccgtgcgccgactgccccggcatccgcacggtgctgaccctgcgcgccgacaacacc 
taccagttgcagacccagtacctggagcgccagccccgcccggacacggtgcaaggcaga 
ttcggctggctgacgggcgacaacgccatcgagctcgacagcgccggcgatcactaccgt 
taccaggtcggcgaaaaccggctgaccatgatgtcgcaagacggcaccctgcccagcggc 
ccgttggccgagcactacgtgctcaagcgcagccagtga 

SEQ ID NO:98 polypeptide sequence of Orf49 

IiALISKKERILKTIiliPVIiAIiAAIiLSACNANAPSOT 

LPCMCPGIRTVLTLRADNT^QLQTQYLERQPRPDTVQGRFGWIiTGD^^ 
YQVGENRLTMMSQDGTLPSGPLAEHYVLKRSQ 



SEQ ID NO:99 polynucleotide sequence of Or£50 

ATGAGACGGTTAAAGGCCCAGGCTTTCGAAGGCaiGCCGCAGCAGGCCGG^ 

CCTTGCTGGCGCTGGCCCTGGGGTTCCAGGGGGCGGCGGCGTGGGCCAATO 

CavCCACTTGCS^CCGCAGCCGGCGGAGCGCATCGaSCCAAGGTAGGGGG 

CACGTCACGGTGCAGGCCGGTGCGCGGATCGAGGCCGGCGACAGCGG^ 

GCCGAGTCCAGATCCAGGACGGCGCCGTCGTGO^GCACGGTCAATACTGCT 

CAAAACGCTGGAAGCAGCAAGCAATAACAATATTTCCATCCAAGTAGGCGCGCAG^ 

AGCGCTTCGCAGTCCAGCGCGTTGGGATTGTCAGGCGCCGGCAATACCXSTCACC^ 

GGGCCGATAATGCCGCGGCAATCnvSGATCACTGCC^ 

CGGGACTATCXSAAACAGTGCTCAATGGCGGCTAaSCCAACGCCATCGGCAGCAC^ 
ACGGGCGCTGGCOTGACXKJrAOGCSUVTCATGCC^^ 
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GCXSAOSACAGCGTCATACTCGACGGCGGCTCTACaVTCACa^ 
Cy^GCCTGACGCTGAAAGCCXKKXSACGGCACGCTGGGCCGCG^^ 
CAGGAGGCTGGAACOTGGACCCTGAATGGCCAGGTaKSCCGC^ 
AGGTGGAGGGaSGCACGCTGGTCTTGCGaS^^ 

GTCCGCCGGOSCTACGGCGGACGTAACTGCCGCCT^GCGCCy^TGCAGTCC^ 

(^GTTCACGCAGGACAGCAATGCCGCCTACGCCGGCGTGCTGAGCGGGACCGGGAGC^ 

GCGGCGGCGACCTGACGTTGACGGGCAACAACACCCATACCGGCAAGGTGGTGGTGG^ 

CAGCGTATCGGCXK5CCAACAACCTGGGTGGCGCAGGTAGTTCGGTAC^ 

CTCAAGAAAACCATCGCCGTCAATCGCGGCOTGACGCTCGATTCCGGGG 

CGGGAACAACCACGACCTGGCAAGGCCAGGTCAGTGGCXSCCGGCA^ 

GGTGCTGGAGC7^CGCGTCCAATACX3TATAGC:X5GCX3GTACGGAGATCAAC^ 

CATGACGCCAGCCTGGGTTCCGGCACGTTGGCGCTCAAGAACAGCCyVGCT 

CGGCCACGCGTGCATTGACXSCTCGCTGGAAACGAAAGCATAGACGTCGCAG^ 

GAACGGAGAAATCAGCGGCXSCCGGCACCCTGGTGAAGGa^GGCCAGGGGACCT^ 

AATCAGCAAAATGGCGGCACGACaSTCyVATGCCGGTACXK: 

GAGGGGC!GCTGGCGCTGAACGACGGCACGCTGCAGAGCACaKKaVGCl^ 

CTTGCHKXX3CCAGGCCACCATGQAGGTCGAa3CTTC^^ 

GGCGGCATGTTGCGCAAGTCAGGCCAGGGCACGCr^GGCCCTGGCCGGCGCa^ACAra 
CGGTGGTCGAGGCCXSGCGCXSCrTCXSGGCAGGACACGAAGA 

CAACCTGGGACGGGGCGCAATAACCCTGCAGGGCGGAGATCTGCTTGCCGGC^ 

CGCGATCTCACGCTTGTCCGCGGTTCCTTGGACGTGGCTCGCGACGCTACCCTGACCTGG 

TATCGGGCGCCGGCX3ATCTGGTCAAAACGGGGGACGGGACCCTGGCGCT 

CGGCCAGACCGTGCTCAGGCAAGGCAAGCTGCGCGTGGCCAGGGAAGAAAGCOTGG^ 

GTGCTGGAAAACAATAOSGTGTTCGAGAGTGCGGGCrCGTATGCCTVTCGGGCGGC^^ 

GCGCGCCCAAGGTGGCAAaX:CCGCGGG(:mc:ACGCTCGA^ 

GCTGTACAAGCAAGGCGGCGGCACGCTCGTGCTGAGCGGCAACAATACCT^^ 

TGGGGCGGGGTCGTGCAAGTCTCTCGCGACCAGAACCrGGGCGCGGCCAATGGCGCGGTC^ 

GCGGCGGGTTGGCGGCCAACGGGGATTTCACaVGCaATCGCCAGC^^ 

CATCGAOJTCGCGGCCGGCAAGGACGTGACGTGGCGCGGTGTCGTCAACGGCGCaSGC^ 

GCCGGCGACGGCACCmraAGGTTGGAGAGCGTGAAavCCTACACCGGTGGC^ 

CCGTGCAGGTATCGCGCXSACAACAACCTAGGCCa^GGCCGCCGGCGCGGTCACGTTCGACGGC 

GGCCAGCACCGGCAGCTTTGCGACCGCACXKX3CGGCCACGCT 

CAGGGCACCACGCTGACGTGGAACGGCGCCATTGGCGGCAAGGGCGAGCTGCGC^ 

CCCTGGTGCTGGGCGGOSCCAACACTTACCAGGGCGACACCCGCGTCGAGGCTGGC^ 

GGCCGACGCCAATCTGGGCCAGGGCGCCGTGCATCTGCACGACAGCCGGCTC^ 

GCGACCTCGCGCCGTCTGGAGTTGACCGGACGTGGCACGGTGCAAGCGGCTGCCGCCGCCACGCTGGATT 

GGCGCGGGACGGTGGCTGGCGCCGGCACGCI^TGAAGGAAGGCGCAGGCACG^ 

CAACCAGCATGCCGGCGGCACCCTGGTCCACGGCGGCACGCTGCGCATCGCCCGOT 

GCGGCGGGCACGGCGGTGACGCTGGACGGCGGCACGCTGGCCACCACGGCATC^ 

CGCTGCGCGTCGGGGCGCGCAATGGCGTATTGCnXSCCGGACGCGGGC^ 

GGTOSCCGGCGCGGGCAAGCrGACCAAGGCCGGTCCJGGGCATO 

GGCGGCGGCACGGCAGTaVCCGGCGGTAOSCTGCAAGTCTCGCGCGACGCC^ 

GCXSCCCTGACGCTGGACXSGCGGCACTTTGCTGAGCACCGC^ 

a3ATGC«3CGGGCGGCACCTTOGTCACCCGOGAOGGC^ 

GCGGGTGGCCTGGTCT^GGAGGGCGCCGGCGAGCTGCGGCTTGGCAATGCC^ 

CCCGCATCGCCGCCGGCCGCCTGGCCGTCAACGGCAGCATCX3CCAGCCCGGTCACGGTCGAGCAGGCTGG 

CGTGCTGGGCGGCACGGGCCGCATCGTCGGGGATGTGGCCAACCGCGGCGTGG 

ATCGGCGCGTTGACGGTAGCCGGCAATTACGCTGGTACCGGCGGCAGCCTGGAAGTGGAGGCGGTG 

GCGGCGACGCCGCGCCGGCCGATCGGCTGGTGCTTGACGGCGGCGCGGCCAGCXSGTGTC^ 

GGTCAAGCCGCAGGGCGGGGTGGGCGGCCTGACCCTGCGCGGCATTCCXSGTGGTCG 

GCCACGACCGCGCCCGGGGCCTTCCGCCTGGCGCAGCCGCTGGTCGCGGGCGCCTACGAGTACCAGTTGC 

TGCGCXXX:GCGGGCGACX3GCGCCGCGGCGCAGGC:X3CA.GGACT 

CGACAAGGCGGGCAGGATCGTCAAGGTCGTGCCCTTCTACCGGCCCXS;^^ 

CCGATGCTGATGCGCATGACAGGCACGGAAATGCTGGGCAGCTATCGa^ 

CGGTGTCGCCCGAAGCGGGCGCCACGGCCGCX3CGCGGTGGATGGGCX3CXK:AC€OTCG^ 

GCGTTCCGCCGGCGGCGAGGCCGCGCCX3TCCrTCGACGGCCATTTGGC 

CTCTACGOSCXKaiGCTCGGGCACGCGGCATACCGACJGCCTTCGGGGTC 

GC!GGCGACGTGCATGGCCTGGCX3C:X3TGGCX5AAATCCAGGCCGTGGG(^ 

ATTGGGCXK:crrATTGGACCCA.CACTGGTCCGGGCGGC^^ 

TACy^GGCAGCAGACGAAGTCGTCCXKn*CAGGTCGGCGCTGTCAGCCGCC^^ 
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TGGAGGCX3K3GCrATCCGaX3GCAGCTCAA(^ 
GC3UVCTXKK3CATTGCCaATGGax:CC3AC^ 

GCTCGGTTAGGTACGCGCCTGTCGGGCCAGTACGCATACGGC3AAGGCGC^ 
TATCGCTGCTGCAa3ATTTCACa2!GC!GCGGAa^ 

CCGCCAGAACACGGCCGTGGATCTGAAGGCGGGCGTGGACACGCAGCTGG^ 

GGGCAGGTAGGCTACGGCAAGTCGGTCGGCAGCGGCGACGGCAGCGACCGTGG^ 

GGCTGCGCGTGGCGTATTGA 



S£Q ID NO:100 polypeptide sequence of OrfSO 

I^^RLKAQAFSGSRSRPAC^GVAPTI^IiAIiAIiGFQGAAAWANCT 

VGGGSTGNNQHVTVQAGARIEAGDSGAISVGNNSRVQIQDGAWQSTVNTAASGQYAKTL 

EAASNNNISIQVGAQLIjAKGSASQSSAIiGIiSGAGNTVTNHGTl 

AKrriDNYGTIETVLNGGYANAIGSTRNNSATGAGVTVRl^^ 

DGGSTITGSLNGGSGimSLTLKAGDGTLGRAIRNFGTITKQEAGTWrijNGQVGROT 

STVKVEGGTLVLRGDNSGATQGGVLQVS AGATADVTAAS AMQS I SNAGTVQFTQDSNAAY 

AGVLSGTGS I VKRGGGDIiTLTGlJiNTHTGKVVVEAGSLS VS AAN^ 

LKKTIAWRGLTLDSGAQTLIIEPGTTTTWQGQVSGAGKLVTQGGTLVLEHASOT 

EINNGTLRAAHDASLGSGTIiALKNSQIiAATDSFTATRALTIiAGNESIDVAATKIL 

ISGAGTIiVKEGQGTIiLLRGTNQQNGGTTVNAGTIiQISRDANLGRGALAIiN^ 

ATSRAATLRGQATMEVDASHTVTWNGEIiSGGGMLRKSGQGTLAIAGA^ 

LRAGHEDlSn^RGAITLQGGDLIiAGGSFSSNRDLTIiWGSIiDVARDATLTW 

VKTGDGTLALTGVNEyAGQTVLRQGKLRVAREESLGGAALVLENNT^ 

TLKGAPKVATPAGDTLEWRGTVDGDGKIiYKQGGGTLVIiSGNNTYAKGVEW 

QNLGAANGAVTIiNGGGIiAANGDFTSNRQLELTAGAKAIDVAAGKDVTWRGVVNGAG^ 

AGDGTLRLESVNTYTGGTRLQGGWQVSRDNNIiGQ^ 

XjNNAGQIDTAQGTTLTWNGAIGGKGEIiRKQGAGTLVIiGGANTYQG 

NLGQGAVHIiHDSRIATTGTFATSRRLELTGRGTVQAAAAATLDWRGTV^ 

TLVLAGDNQHAGGTLVHGGTLRIARDANIjGAAGTAVTLDGGTIATT^ 

NGVLLPDAGTTLDWRGWAGAGKLTKAGPGMLVLSADNRHGGGTAVTGGTLQVSRDA^^ 

AAAGALTLDGGTLLSTASFASARAATLDAAGGTPVTRDGTRLDWDGAIGGAGGLVKEGAG 

ELRLGNAOTYQGPTRIAAGRLAWGSIASPVTVEQAGVLGGTGRIVGDVANRGWAPG^ 

IGALTVAGNYAGTGGSLEVEAVLGGDAAPADRLVLDGGAASGVTPVVVKPQGGVG^ 

GIPVWAQGGATTAPGAFRIiAQPLVAGAYEYQLLRGAGDGAAAQAQDWYIiRTSRVERDKA 

GRI VKVVPFYRPEVAL YAGTPMLMRMTGTEMLGS YRERAGQTGAVS PEAGATAARGGWAR 

TFGRRFERSAGGEAAPSPIXSHLAGAQLGADIjYARSSGTRHTDAFGVFGGYATVRGDVHGL 

argeiqavgtstlratqiigaywthtgpggwyidtviiagtryrqqtkssaqvgavsrgw^ 
tasveagypwqlnprwriepqaqvvyqqlgiangadrvstvsyktpdaltarlgtrlsgq 
yaygkaqlrpfmgvsliihdftgadtvtfagahgvrasrqotavdiikagvot 

GQVGYGKSVGSGDGSDRGWSANLGIiRVAY 



SEQ ID NO:101 polynucleotide sequence of OrfSl 

atgaacaaaccctcg?\aattcgctctggcx3ctcgccrtcgccgct 

cgcaaaccgtggacaactggcgcaatccgtttggcgacgtttggaaqaac^ 

gcgcgatgcgttctggaccccggctaccggcatccccggttgosacggcgttccggtcgctca 

AAGxCCGCTCX;CATGGCCGCCAAGGTCGTGTTCAATGCTGAC^ 

TGAAGCCGGAAGGCCGCCAGCTGCTGGATCAAGTCGCCCAGCAAGCCGGi^CGATCGATCTC^ 

CATCGCOGTTGGCCACACGGACTCGATCGGCACCXSJ^ 

GCTGCGGTCAAGACCTACCrrGGTCAGCAAGGGTATCGACCCCA^ 

AKCTGCAACOSATCXSCrTCXSAACAAGACGCGTGAAGGCCGTC 

CCTCGGTAGCCGCAAGAACTAA 



SEQ-ID-NOrl02 polypepUde sequence of OrfSl 
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MNKPSKFAIJOiAFAAVTASGAASAQTVDmRNPFGDVW^ 

CTK5VPVAQKEKSAPMAAKVVFNADTFFDFDKSTLKPEGRQIi^ 

GHTDSIGTEAYNQKLSERRAAAVKTfliVSKGIDPNRIYTEGKGELQPIASJ^^ 

RRVEIEIVGSRKN 



SEQ ID NO:103 polynucleotide sequence of Orf52 

ATGAACAAACCCTCCAAATTCGCrCTGGCGCTCGCCTTCGCCGCTGTTAC^ 

CGCAAACCGTGGACAACrGGCGCAATCCGTTTGGCGAC^ 

GCGCGATGCGTTCTGGACCCCGGCTACCGGCATCCCXJGGTTGC^^ 

AAGCCCGCTCCCATGGCOKia^GGTCGTGTTaATGCn^a^C 

TGAAGCCGGAAGGCOK^CAGCTGCTGGATCAAGTCGCCCAGCa^AGCCGGC^^ 

CATCGCCGTTGGCCACACGGACTCQATCGGCACCGAAGCCTACAACC^^ 

GCTGCGGTCAAGACCTACCrGGTCAGCAAGGGTATCGACCCCAACCGTATCTAC^^ 

AACTGCAACCGATCGCTTCGAACAAGACXK^GTGAAGGCCGTGCCCAGA^^ 

CGTCGGTAGCCGCAAGAACTAA 



SEQ ID NO: 104 polypeptide sequence of OrfS2 

MNKPSKFAIiALAFAAVTASGAASAQTVDNVmNPF^^ 

CDGVPVAQKEKPAPMAAKVVPNADTFFDFDKSTLKPEGRQLLDQVAQQAGTIDriETIIAV 

GHTDSICTEAYNQIOiSEiay^VKTYLVSKGIDPimiYTEGKGEIjQPIASlS^ 

RRVEIEIVGSRKN 



SEQ ID NO:105 polynucleotide sequence of Orf53 

ATGAAGTTCTACCCTTCCCATCCGATGCCCGAGTCGCTCGCGGCTGCGATCGCAGTQCCT^ 

TGCTGCCGGCGGCGCAGGCCGCGTCCACGGCGGTCCAGCTGCCATCCGTCACGGTCGAGTO 

GTCCTATCAACCGGAAAGCGCCCAGTCGCCCAAGTTCACCGCGCCCCTGGCGGACACGCCGOT 

CAGGTCATCCCTGAGCGGCTCT^TCCAGGACCAGGGGGCCAGCGACCTCGAAGCGGTACT^ 

CAGGGATATCGATGACCGCCGGCGAAGGCGGCCGTCCGGCCAGCGACCTGCCGTTCATCCGCGG 

TTCGGCGAGCAGCCTGTTTGTCGACGGCCTGCGOSATCCCAGCACGCAATCGCGC^^ 

GAACAGGTCGACXSTCGTCAAGGGGCCCGATTCGGTATTTTCCGGGCGCGGCGGCGCCGGCG^ 

ACCTCGTCACCAAGACGCCCAGGAACCAGGATTTCACCGAAGTCCAGGCCGGC^^ 

CTACOSAGGCACCATAGACXSGCAACTGGGTGCTGGGCGAGAACACGGCGCT^ 

ACCAGGGGCACCGTGCCGGGCCGCGACAAGGCGGTCGAGTTCAGCCGTO^ 

GCCTGGGCCTGAGCGGCCCCACCCXSCGTGAaSCTGGGCCTGTACCTCTATCGCC^^ 

ttattcgattco3tacgatccxk:gc:accggcacgccxsatc^^ 

TTCTACGGCCTGGTGCAGOK^GACTCCGGCGATACCGAGGACTACGCaSCCACCGTC^ 
ACCTCGCCAATGGCTTCAAGGTGGAGAACCTGGCGOSCTACTCGCGCGCCA^ 
CATCCCCGAACTGAAAACCGCCGATCTGGCCAAAGGGCTGGTGTACCGC^ 
GTCAACGACAGTTTCGCCAACCGCACCGACCTGCGCXSGCACATTC^ 

TCGATCTGGGCGGGGAGTTCGCCACCAGCCGGCGCyVGTCGCGACCGCTAa^GCAGGAAATCCC^^^ 

CXX^CAGTCCTTQCTCGCCCGTGACGGGCGGCAACAATCCCGCCCTGTGCGCCT 

CCGCACGTGGATTTCCCGGGAACGGTGCGGCGCAACCATAACCCGGCCCGCTAC^ 

CCCTGTACGGTTTCGAO^CCATCGCCTTCGACGAGCAGTGGCAGCTGAATCTCXSGCCTGCGCT 

CTACAAGACCAGCGGACGCAACCTGCCGGTACGAGGCGCC^GCCGCCCGTCrACXS^ 

ACCGAGAACCTGTTCAACTACCAGCTCGGCCTGGTCTACAAGCCTCGTCCGGAC^ 

GCTACGGCACGGCGTCCACGCCGTCGGCCGTGTCCQACTACGCCCCGGCGGAC^ 

CCAGCAGTTCAAGCCGGAGCGCAGCGAGGTGATCGAGGTCGGGACCAAGT^^ 

CTGCI^TGAa3GGCGCCATGTTCC6CGAAACGa3CAAGAA»^ 

GCXSCACCGGCOSGCAAGAGCCGCGTCACaKKIATGGAG^ 

GGACGTCTACGGCGGCTACGCGCTGCTCGACAGCAAGCTGGTC^ 

GGCCAGCCGCTGCCCAGCGCGCCCCGGCACGCATTCAGCATCTGGAGC7VCCTACAA 

TGACCGTGGGGGCaK3CX3CGTTCTATCGCAGCAAGGT^ 
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TGGCAACGCAGATGCCGGCCACAACA^GGACGGCAO^^ 
TTCGACGCCATGGCGGCGTACCft 
ACAAGACCTATTACGCCJ^AGACC 
GACGTTCAAGCTGAGCTACTGA 



TTCGACGCO^TGGCGGCGTACCAGCTCAACAAGCACCOTACGGCCCS^GTrGAATO^ 
JJ^CCTATScGCCAAGACCTACCGCAGCa^TTAOKXSGOGCTGG^ 



SEQ ID NO:106 polypeptide sequence of Oif53 

MKFYPSHPMPBSIAAAIAVPLLGIiPAAQAASTAVQLPSVTVEGEYSSYQPESAQS^^^ 

akLjtprtvqviperliqdqgasdleavijinapgisotageg^ 
Sfvtc^pstqsiudtfhleqvdvvkqpdsvfsgrggaggsinlvtktprnqdftbv^^ 

GIGTAETYRGTiroiWVMBOTAIJRMJlJ^TRGTVPGRDK^^ 

TRWLGLYLYRHRRVPDYSIPYDPRTGTPITETIGVSRRNFYGLVQRDSGDTE^^^ 

WEHDIJUSGFKVEOTJUIYSRATVEQITTIPEIJCTAPIAKGLVYRNLBM^ 

Sfdtgqwwitfdlggefatsiuisrdrykqeipdaaspcspvtggijnpai.c^ 

PHVDFPGTWraNPARYHTDILSLYGFDTIAFDEQMQIiNLGUlWDHYKTSGR^^ 

Svyesaartdnlfnyqlglvykprpdgsvyasygtastpsavsdyapadnisg^^ 

KPERSEVIEVGTKWQVIJ3RRLI.VTGAMPRETRKNTSIEVAEGLRAPAGKSRVTGMEI,GVA 

SsSpISwSSiKLVRASHNSGAQGQPI.PSAPRHAFSIWST^P^^ 

FYRSKVYGNADAGHNKDGTPKARWPAYWRFI1AMAAYQLNKHI.TAQIJJVYN1.L^ 

TYRSHYAAIjGPGRSAMLTFKLSY 

SEQ ID NO:107 polynucleotide sequence of Orf54 

ATGAAAAAGACTCT^CTCGCTGCaSCCCTGCrCGCCGGTTTCGC^ 
?S5?SGTASGSTCATCGACACGGGTATCGQCTAa^a3ATGTCGAT^ 

cgccgSSSagcgacotcaagtacaaccaowkicgot 

cSSSSJSgcSgotSggaagatctcggtgaosgccto 

??S???SSaacgg?Iactcx3GCCcaa6Acggccgcctgttc^ 

AlrrrRARGCTGGGGCCGTOTGGACTTCGGTCGCC^ 

5iTC?5???SSSS?SSTSGCCAACATCGGCATGGGCATGAGCGa3ATGAAC^^ 

SSISSS^StotacSgaccccgtcgtacagcggcttccagttcggtatcggcm^ 

rSGScSSAGGATGCTGACGCCGTCAACCGCGTTGGCTTCG^ 

SJS^SJCTS^ACarcScGGCCCGCTGAACGTCGCTCTGTCGTACGACCAGCT^ 

AScSSGSSS?G^GACGCGACCCCGCGCAGCTACGGCCrCGGCGGTTCGTA^^ 

^S^ScTCGCTCTGGCCrACGCTCGCACGACCGACGGCTGGTrCGGTGGCCAAGGC^^ 

^g??SgctgcScgggcgacaagttcggcggcttcggcgtgaaca^^^ 

S^SS^SSSTcScCTGTCGGCCCCCATCGGCGGCGCCAGCyACGTGrrCG^^ 

S??SSc^agct^ccggcggc6acgagaagatgaacgtcttctcgct6ggctacaccta^^ 
tcSS^Sc^SStacgcctacggttcgtacgccaagaacttcgcgttcct^ 

CGACCGCK3T(XK3a5TCGGTATCC»TCACXX5CTTCTAA 

SEQ ID NO:108 polypeptide sequence of Orf54 

MIOCTLIAAAIJ^GFAGAAQAETSVTLYGIIDTGIGYiroVDFKVKGANADDSDPKYl^ 
GMINGVQNGSRWGLRGTEDIK5DGI/3AWQLESGFNSGNGNSAQDGRLFGRQATIGW 
^RliFGRQTNIASKYFGSIDPFGAGFGQANIGMGMSAMNTVRYDNMVMYQTPSYSGPQF 

GXGYSFSANDKDADAVNRVGFATADimiAITTGrJiyVNGPLNVA^ 

?^rreSYLGGSYDFEVVKIJ^YARlTDGWFGGQGYPVAVTI.PS^G^^ 

GPKft^YMVGI^IGGASNWGSWQMVDPKLTGGDEKMNVFSIK5YTYDI.SKRTOTjYAYG 

SYAXKPAPL-EESAKSTAVQVGIRHRF 

SEQ ID NO:109 polynucleotide sequence of OrBS 

^^^^^^^^^^^^^^^^^ 
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CGCCGACGGGAGKTGACTTCAAGTACAACCACAGCCXKOT 

CGCTGGGGTCTGCGTGGTACGGAAGATCTGGGTGACXSGCCTGCAAGC^ 

XaVGCTOSGCCAAaSGTAACTCGGCCCy^AGAa^ 

aagcgaaagctggggcostctggacttcxsgtcgcca;^ 

GATCCGTTCGGCGCTGGCTTCGGTCAAGCCy^CATCGGCATGGGCATGAG^ 

ACGACAACATGGTCATGTAC<^GACCCC6TCGTACAGCGGCTTCCAGTTCG^ 

CGCGAACGACAAGGACGCTGACGCCGTCAACCGCXSTTGGCTTCGCCACCGCCGAC^ 

acgaccggtctgcgctacgtgaacggcccxsctgaacgtcgctctgtcgtacgaccagctg^ 

Aa\ACC7^GCCCAAGACGAAGTTGACGCCACCCa3CGC^ 

agtcgtgaagctggctctggcctacgctcgcacgaccgacggctggttcggt 

GCTGTCACGCTGCCCTCGGGCGACAAGTTCGGCGGCTTCGGCGTGAACACCTTCGCTGACGGC^ 

CGAACTCCTACCTGTTGGGCCTGTCGGCTCCKIATCGGCXSGCGCCAGC^ 

GGTTGACCCCAGCAACGACAAGCTGACCGGCGGCGACQAGAAGATGAACGTC^ 

TACGACCTGTCCAAGCGCACCAACCTGTACGCCTACGGTTCGTAC^ 

ATGCOVAGTOSACCXSCTGTCGGCGTCXSGTATCCGTaVCCGCTTCTAA 



SEQ ID NO:110 polypeptide sequence of Orf55 

MKKTIxLAAAIiIiAGFAGAAQAETS VTIiYGI IDTGIGYNDVDFKVKGANADGSDFKYNHSRF 

GMINGVQNGSRWGLRGTEDLGDGIjQAVFQLESGFSSANGNSAQDGRIiFGRQATIGLQSES 

WGRLDFGRQTNIASKYFGSIDPFGAGFGQANIGMGMSAlXDSriWYDNMVl^ 

GIGYSFSANDKDADAVNRVGFATADNVRAITTGLRYVNGPLNVALS'^^ 

VDATPRSYGIGGSYDFEVVKLAIiAYARTTDGWFGGQGYPVAVTLPSGDKFGGFGW 

GFKANSYLLGLSAPIGGASNVFGSWQMVDPSNDKIjTGGDEKMNVFSLGYT^ 

AYGSYAKNPAFLEDAKSTAVGVGIRHRF 



BPP0452 : Auto transporter 

>BPP0452 B. parapertussis np_882803 
>BPP0452_n 

B. pertussis homologous Beguence : SEQXD50 in VB60452 
BPP3135 : QmpA 

>BPP3135 B. parapertussis np_885310 
>BPP3135_n 

>BP0943 B, pertussis 
>BP0943_n 

BPP3376 : Probable TonB-^ dependant receptor for 
iron transport 

>BPP3376 B, parapertussis np_885539 
>BPP3376 n 
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B, pertussis homologous sequence : SEQID14 in VB60452 

BPP3392 : Outer membrane porln protein precursor 

>BPF3392 B. parapertussis np_885555 
>BPP3392_n 

>BP0840 B. pertussis 
>BP0840 n 
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